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TECHNIQUES FOR ASSESSING NONSPECIFIC BINDING 
OF NUCLEIC ACIDS TO SURFACES 

Technical Field 

This invention relates generally to nucleic acid chemistry and hybridization assays. 
Particularly, the invention relates to methods for improved detection and analysis in 
nucleic acid hybridization assays. The invention provides reliable methods for estimating 
background signal deriving primarily from nonspecific hybridization. The invention is 
useful in chemical, biological, medical and diagnostic techniques, as well as for drug 
discovery. 

Background of the Invention 

In nucleic acid sequencing and analysis, there is a growing emphasis on the use of 
high density arrays of immobilized nucleic acid probes. Such arrays can be prepared by 
massively parallel schemes, e.g., using the selective photomask techniques described in 
U.S. Patent No. 5,143,854. Arrays constructed in this manner are typically formed in a 
planar area of between about 4-100 mm 2 , and can have densities of up to several hundred 
thousand or more distinct array members per cm 2 . 

In use, an array surface is contacted with one or more analytes under conditions 
that promote specific, high-affinity binding of the analyte molecules to one or more of the 
array members (probes). The goal of the procedure is to identify one or more position- 
addressable members of the array which bind to the analyte as a method of detecting 
analyte molecule(s). Typically, the analyte is labeled with a detectable label such as a 
fluorescent tag, to indicate the one or more array regions where analyte binding to the array 
occurs. A variety of biological and/or chemical compounds have been used as 
hybridization probes in the above-described arrays. See, generally, Wetmur, J. (1991) Crit 
Rev Biochem and Mol Bio 26:227. 

For example, such arrays can be used to perform nucleic acid hybridization assays. 
Generally, in such a hybridization assay, labeled single-stranded analyte nucleic acid (e.g. 
polynucleotide target) is hybridized to a complementary single-stranded nucleic acid 
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probe. The complementary nucleic acid probe binds the labeled target and the presence of 
the target polynucleotide of interest is detected. 

A common drawback of nucleic acid hybridization assays is the presence-of signals 
which are generated due to an undesirable interaction of various components used in a 
given assay, i.e. signal generated by entities other than due to hybridization of the analyte 
and the specified complementary probes, such as signal generated from (i) the reporter, i.e. 
a signal arising from the label itself when it is not attached to the target, such as a signal 
generated from a fluorescent dye used in labeling the target; (ii) the non-reporter, i.e. a 
signal generated from the substrate or other assay components, and (iii) signal generated 
due to non-specific binding of probes to labeled entities other than their specific target 
molecules, i.e. binding not related to hybridization of the analyte and the complementary 
probes. Background signal generated from any of these mechanisms will add to the total 
signal measured. Uncorrected signal containing background signal results in an 
overestimation of the "real" signal, which can lead to "false positive" results. Thus, the 
background signal needs to be estimated accurately and subtracted from the total signal of 
a hybridization assay to yield the "real" signal. 

However, accurate estimation of the background signal is complicated. 
Underestimation of the background signal will result in an overestimation of the "real" 
signal, which can yield "false positive" results. Conversely, overestimation of the 
background signal will result in an underestimation of the "real" signal, which can yield 
"false negative" results. Thus, background overestimation will negatively impact the 
lowest concentration of the target that can be reliably detected. An accurate estimate of the 
background signal is thus needed to generate accurate results. 

A common approach to correcting background signal in arrays is to evaluate the 
portion of the array that is outside of the probe features. However, the background 
correction problem is particularly complex for measurements made using arrays of nucleic 
acid hybridization probes, because background may vary as a function of location on the 
surface. Furthermore, the local properties of the surface that contains bound nucleic acid 
probes may be very different from the surrounding surface that does not contain bound 
probes. The "local background signal" is the signal generated from the portion of the array 
outside of the probe feature area. The signal from the local background immediately 
adjacent to a given feature is subtracted from the total signal of that feature to correct for 
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background and to yield the "real" signal. Alternatively, the local background signal from 
the entire array can be evaluated and a single value (e.g. an average local background 
signal or the minimum local background signal) can be calculated to correct all features of 
that array. This is referred to as the "global background signal". The choice of an 

5 appropriate background correction method depends critically upon which of these two 

influences, i.e., local background or modification of surface properties by covalently bound 
nucleic acid probes, is judged to most strongly influence background signal in the array 
regions containing covalently bound probe molecules. 

The use of local or global background correction methods are problematic. The 

10 properties of the array surface outside the features may differ from the properties of the 
array surface within the features. These differences can result in different levels of non- 
reporter signal or different levels of reporter non-specific binding. Thus, the observed 
signal from the local background or estimated from a global background calculation may 
result in an inaccurate estimation of the background signal within the feature. 

15 Additionally, the probes themselves may generate a portion of the background signal. For 
example, the bases or phosphodiester linkages of the probes may (i) produce non-reporter 
signal, (ii) bind to components which produce non-reporter signal, or (iii) non-specifically 
bind the reporter. Therefore, in these cases, using local background will underestimate the 
true background signal that should be subtracted. 

20 Representative methods for resolving the problem of interfering background 

signals in nucleic acid hybridization assays are described in U.S. Pat. Nos. 4,868,105; 
5,124,246; 5,563,034; and 5,681,702; WO 98/24933; Chen Y., et al., Journal of 
Biomedical Optics (1997) 2:364-374; and DeRisi J.L. et al. (1997) Science 278:680-686. 
Existing methods generally correct for background signal by subtracting either the local or 

25 global background. However, these methods do not involve surface-bound nucleic acid 

probes, and in some cases background estimates obtained from local or global sampling of 
nonprobe regions overestimate background in regions that contain probes. Background 
overestimation negatively impacts the lowest dose of the target that can be reliably 
detected by an array involving a nucleic acid hybridization assay. 

30 Therefore, there is a need for reliable methods for estimating background signal 

from probe-containing regions in hybridization arrays. 



-3- 



Attorney Docket No. 10981620-1 



Summary of the Invention 

The present invention provides an array, methods and kits using the array, for 
detecting a target nucleotide sequence in an analyte. In a general aspect, the invention 
provides a detection process or assay capable of detecting the presence and/or amount of a 
5 target nucleotide sequence in the analyte. Further, the invention provides an improved, 
efficient and cost-effective method for nucleic acid hybridization assays. 

Accordingly, in one embodiment, the invention provides a set of features 
comprising oligophosphodiester probes, wherein the features comprise hybridization 
features comprising hybridization probes that selectively hybridize to a detectably labeled 

10 target nucleotide sequence, and background features comprising background probes that do 
not selectively hybridize to the target nucleotide sequence, and further wherein the probes 
may be in solution or are bound to a surface. The target nucleotide sequence may be 
labeled with a detectable label prior to, or after hybridization, preferably prior to 
hybridization. In one embodiment, the target nucleotide sequence is directly labeled with a 

15 detectable label. In an alternative embodiment, the target nucleotide sequence is indirectly 
labeled with a detectable label prior to, or after hybridization, preferably prior to 
hybridization. In a preferred embodiment, the set of probes is bound to an array surface. 
In another preferred embodiment, the background probe is selected from the group 
consisting of empirically observed inactive probes, probes forming stable intramolecular 

20 structures, short probes, probes comprising reverse polarity nucleotide analogs and probes 
comprising abasic phosphodiesters or modified nucleotidic units. 

In an additional embodiment, the invention provides a method of detecting the 
presence and/or amount of a target nucleotide sequence in an analyte. The method of the 
invention comprises the following: 

25 (a) providing an analyte suspected of containing the target nucleotide sequence; 

(b) contacting an aliquot of the analyte suspected of containing said target 
nucleotide sequence with a set of features comprising oligophosphodiester probes, wherein 
the target nucleotide sequence is labeled with a detectable label capable of generating a 
measurable signal, and further wherein the features comprise: 

30 (i) hybridization features comprising hybridization probes that selectively 

hybridize to the target nucleotide sequence, and 
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(ii) background features comprising background probes that do not selectively 
hybridize to the target nucleotide sequence; 

(c) detecting an observed signal, wherein the observed signal is an amount of 
signal generated from contacting the target nucleotide sequence with said features 

5 comprising oligophosphodiester probes; 

(d) detecting a background signal, wherein the background signal is an amount 
of signal generated from the background features; and 

(e) subtracting the background signal from the observed signal to determine the 
presence and/or amount of the target nucleotide sequence in the analyte. 

10 In a preferred embodiment, the method involves the use of a set of probes bound to 

an array surface. In an additional preferred embodiment, the method involves the use of 
background probes that mimic nonspecific binding, while not preventing the binding of 
target to the hybridization probes. In more preferred embodiments, the background probe 
is selected from the group consisting of empirically observed inactive probes, probes 

15 forming stable intramolecular structures, short probes, probes comprising reverse polarity 
nucleotide analogs and probes comprising abasic phosphodiesters or modified nucleotidic 
units. 

In certain embodiments, the label is detected using colorimetric, fluorimetric, 
chemiluminescent or bioluminescent means. Thus, the label can be, for example, a 
20 fluorescent compound, i.e., capable of emitting radiation (visible or invisible) upon 

stimulation by radiation of a wavelength different from that of the emitted radiation, or 
through other manners of excitation, e.g. chemical or non-radiative energy transfer. The 
label may be a fluorescent dye associated with a nucleic acid. 

In another embodiment, the present invention is directed to a method for estimating 
25 background noise encountered in a nucleic acid hybridization assay. 

In another embodiment, the present invention is directed to a method of validating 
a test-background feature comprising test-background probes. The method of the 
invention comprises the following: 

(a) providing an analyte containing the target nucleotide sequence; 
30 (b) contacting an aliquot of the analyte containing the target nucleotide 

sequence with a set of features comprising oligophosphodiester probes, wherein the target 
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nucleotide sequence is labeled with a detectable label capable of generating a measurable 
signal, and further wherein the features comprise 

(i) hybridization features comprising hybridization probes that selectively 
hybridize to the target nucleotide sequence, 
5 (ii) test-background features comprising test-background probes that do not 

selectively hybridize to the target nucleotide sequence, and 
(iii) standard-background features comprising standard-background probes that do 
not selectively hybridize to the target nucleotide sequence; 

(c) detecting an observed signal, wherein the observed signal is an amount of 
10 signal generated from contacting the target nucleotide sequence with said features 

comprising oligophosphodiester probes; 

(d) detecting a test-background signal, wherein the test-background signal is an 
amount of signal generated from the test-background features; 

(e) detecting a standard-background signal, wherein the standard-background 
15 signal is an amount of signal generated from the standard-background features; and 

(f) comparing the amount of the test-background signal with the amount of the 
standard-background signal. 

Finally, the invention encompasses test kits for detecting the presence and/or 
amount of a target nucleotide sequence in an analyte. The kit comprises a container 
20 containing an array of features comprising oligophosphodiester probes, wherein the 

features comprise hybridization features comprising hybridization probes that selectively 
hybridize to a target nucleotide sequence, and background features comprising background 
probes that do not selectively hybridize to the target nucleotide sequence. 

These and other embodiments of the present invention will readily occur to those of 
25 ordinary skill in the art in view of the disclosure herein. 



Brief Description of the Drawings 

Figure 1 illustrates the results of a hybridization assay, wherein empirically 
observed background probes (tabulated in Table 1) and specific hybridization probes were 
30 hybridized to rhodamine 6-G (R6G)-labeled G3PDH cRNA (SEQ ID NO: 1). 
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Figure 2 is a magnified view of a portion of Figure 1, showing the signal of 
features containing the empirically observed background probes, and the signal from the 
local background outside the features. 

Figure 3 illustrates a region of a hybridized array, wherein the empirically observed 
5 background probes appear as dark features within brighter local background and the 
specific hybridization probes appear as bright features. The array contained features 
specific to five xenogenes, positive control features, and background features. The array 
was hybridized to R6G-labeled cRNA from five xenogenes as well as R6G-labeled 
complex human pool cRNA. 
10 Figure 4 illustrates the results of a hybridization assay, wherein the designed 

hairpin probes were hybridized to R6G-labeled yeast cRNA. 

Figure 5 illustrates the results of a hybridization assay, wherein the shortened 
G3PDH-570 probes (SEQ ID NO: 17, 29-32) were hybridized to R6G-labeled yeast cRNA. 

Figure 6 depicts various abasic phosphoramidite structures which can be used to 
15 produce background probes of the present invention. 

Figure 7 is a magnified view of a portion of Figure 3, showing the signal of 
features containing the empirically observed background features, specific hybridization 
features, and the local background outside the features. 

Figure 8 shows the signal from a region of the array seen in Figures 3, illustrating 
20 the signal from background features, hybridization features and local background. As 
observed in the figure, the signal from the background features is much less than the 
surrounding local background, yielding "negative features." 

Figure 9 demonstrates the inter-feature standard deviations (interfeature_SD) of 
signal from replicate features of hybridization probes, wherein the standard deviations are 
25 calculated with or without local background correction. The five arrays contained specific 
features to five xenogenes, positive control features, and background features. The arrays 
were hybridized to R6G-labeled cRNA from five xenogenes, ranging from 3 pM to 3000 
pM, as well as R6G-labeled human cRNA pool target. 

Figure 10 illustrates signal statistics of one probe, cor47-181 (SEQ ID NO:33) 
30 across the five arrays described in Figure 9. The global background and average 

background feature signals from each array are also shown. The background features 
accurately model the cor47-181 (SEQ ID NO: 33) features that have low signals, while the 
global background overestimates these background signals. 
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Figure 1 1 uses the non-background-corrected signal data from Figure 8 and 
compares the threshold limits of signal significance using three methods: background 
features, local background, and minimum global background. The figure illustrates the 
effect of background overestimation on data analysis: hybridization features that are 
5 significantly different from the threshold using background features are not significant 
when using either local or global background thresholds. 

Figure 12 uses the non-background-corrected signal data the pbp 1-203 probe (SEQ 
ED NO:35) from the five arrays described in Figure 9. The global background and average 
background feature thresholds from each array are also shown. The use background 
10 features, as opposed to global background, yields a much lower LLD, i.e. increased 
sensitivity. 

Figure 13 illustrates the effect of the method of background correction has on LLD 
calculations, wherein the use background features, as opposed to global background, yields 
a much lower LLD, i.e. increased sensitivity. The data was obtained from the five arrays 
15 described in Figure 9. 

Detailed Description 

The practice of the present invention will employ, unless otherwise indicated, 

conventional techniques of chemistry, biochemistry, molecular biology, and medicine, 
20 including diagnostics, which are within the skill of the art. Such techniques are explained 

fully in the literature. See, e.g., Solid-Phase Synthesis, Blossey, E. C. and Neckers, D. C. 

Eds. 1975; Sambrook, Fritsch & Maniatis, Molecular Cloning: A Laboratory Manual; 

DNA Cloning, Vols. I and II (D.N. Glover ed.); Oligonucleotide Synthesis (M.J. Gait ed.); 

Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds.); and the series, Methods In 
25 Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); Beaucage and 

Carruthers, Tetrahedron Lett, 22:1859-1862 (1981); Matteucci, et al, /. Am. Chem. Soc, 

103:3185 (1981); Letsinger, R.L. and Mahadevan, V., J. Amer. Chem. Soc, 88:5319-5324. 
All patents, patent applications, and publications mentioned herein, whether supra 

or infra, are hereby incorporated by reference in their entirety. 

30 
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A. Definitions 

In describing the present invention, the following terms will be employed, and are 
intended to be defined as indicated below. 

It must be noted that, as used in this specification and the appended claims, the 
5 singular forms "a", "an", and "the" include plural referents unless the content clearly 

dictates otherwise. Thus, for example, reference to "a target analyte" includes a mixture of 
two or more such analytes, "a reagent" includes a mixture of two or more reagents, and the 
like. 

Unless defined otherwise, all technical and scientific terms used herein have the 

10 same meaning as commonly understood by one of ordinary skill in the art to which the 
invention pertains. The following terms are intended to be defined as indicated below. 

As used herein, the terms "hybridization," "hybridizing" and "binding" may be 
used interchangeably. The ability of two nucleotide sequences to hybridize with each other 
is based on the degree of complementarity of the two nucleotide sequences, which in turn 

15 is based on the fraction of matched complementary nucleotide pairs. The more nucleotides 
in a given sequence that are complementary to the nucleotides in another sequence, the 
more stringent the conditions can be for hybridization and the more specific will be the 
binding of the two sequences. Increased stringency is achieved by elevating the 
temperature, increasing the ratio of co-solvents, lowering the salt concentration, and the 

20 like. Hybridization processes and conditions are described by Sambrook, J. et al., 

(Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, 2 nd Ed., 1989, vol. 1-3). Conditions for hybridization typically include high 
ionic strength solution, controlled temperature, and the presence of carrier DNA and 
detergents and divalent cation chelators, all of which are well known in the art. 

25 As used herein, the term "specific hybridization" refers to those occurrences in 

which a segment of an oligonucleotide probe preferentially hybridizes with a segment of a 
selected polynucleotide, as intended. The use of the term "hybridizes" is not meant to 
exclude non Watson-Crick base pairing. 

As used herein, the term "nonspecific hybridization" refers to those occurrences in 

30 which a segment of an oligonucleotide probe does not preferentially hybridize to a segment 
of a selected, specific complementary first polynucleotide but also hybridizes to a second 
polynucleotide, triggering an erroneous result, i.e., giving rise to a situation where label 
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may be detected in the absence of a the specific complementary polynucleotide (e.g. 
nucleotide sequence or a target molecule). The use of the term "hybridizes" is not meant 
to exclude non Watson-Crick base pairing. 

As used herein, the term "nonspecific binding" is used to refer to those occurrences 
5 in which a polynucleotide binds to the solid support, or other assay component, through an 
interaction, which may be either direct or indirect, that does not involve hydrogen bonding 
to support-bound oligophosphodiesters. 

A "nucleoside" has two components: a nitrogenous base and a pentose sugar. A 
"nucleotide" has 3 components: a nitrogenous base, a pentose sugar and a phosphate. 
10 (See, Lehninger A.L., et al., "Principles of Biochemistry" 2nd Ed, Worth Publishers, 
(1993) p. 325). 

As used herein, the term "oligophosphodiester," refers to polymeric molecules, 
including oligonucleotides, polynucleotides, modified nucleotides, modified nucleotidic 
units and abasic phosphodiesters, as described in, e.g., Example 5, infra. 

15 As used herein, the terms "nucleic acid molecule," "oligonucleotide," "nucleotide 

sequence" and "polynucleotide" may be used interchangeably, and refer to nucleic acid 
molecules and polymers thereof, including conventional purine or pyrimidine bases as well 
as base analogs. Such molecules include without limitation nucleic acids, and fragments 
thereof, from any source in purified or unpurified form including DNA, double-stranded or 

20 single stranded (dsDNA and ssDNA), and RNA, including t-RNA, m-RNA, r-RNA, 
mitochondrial DNA and RNA, chloroplast DNA and RNA, DNA/RNA hybrids, or 
mixtures thereof, genes, chromosomes, plasmids, the genomes of biological materials such 
as microorganisms, e.g. bacteria, yeasts, viruses, viroids, molds, fungi, plants, animals, 
humans, and the like; polynucleotides containing an N- or a C-glycoside of a purine or 

25 pyrimidine base; other polymers containing nonnucleotidic backbones, for example, abasic 
phosphodiesters (as described in, e.g., Example 5, infra), polyamide (e.g., peptide nucleic 
acids (PNAs)) and polymorpholino (commercially available from the Anti-Virals, Inc., 
Corvallis, Oregon, as Neugene™ polymers), and other synthetic sequence-specific nucleic 
acid polymers providing that the polymers contain nucleobases in a configuration which 

30 allows for base pairing and base stacking, such as is found in DNA and RNA. The 

polynucleotide can be only a minor fraction of a complex mixture such as a biological 
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sample. Also included are genes, such as hemoglobin gene for sickle-cell anemia, cystic 
fibrosis gene, oncogenes, cDNA, and the like. 

The terms "polynucleotide" and "oligonucleotide," also include known types of 
modifications, for example, labels which are known in the art, methylation, "caps," 
5 substitution of one or more of the naturally occurring nucleotides with an analog, 

internucleotide modifications such as, for example, those with uncharged linkages (e.g., 
methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), with 
negatively charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), and with 
positively charged linkages (e.g., aminoalklyphosphoramidates, 

10 arninoalkylphosphotriesters), those containing pendant moieties, such as, for example, 

proteins (including nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), those 
with intercalators (e.g., acridine, psoralen, etc.), those containing chelators (e.g., metals, 
radioactive metals, boron, oxidative metals, etc.), those containing alkylators, those with 
modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of 

15 the polynucleotide. 

Various techniques can be employed for preparing a polynucleotide. Such 
polynucleotides can be obtained by biological synthesis or by chemical synthesis. For 
short sequences (up to about 100 nucleotides), chemical synthesis is economical, provides 
a convenient way of incorporating low molecular weight compounds and/or modified 

20 bases during specific synthesis steps, and is very flexible in the choice of length and region 
of target polynucleotide binding sequence. Polynucleotides can be synthesized by standard 
methods such as those used in commercial automated nucleic acid synthesizers. Chemical 
synthesis of DNA on a suitably modified glass or resin can result in DNA covalently 
attached to the surface, potentially advantageous in washing and sample handling. For 

25 longer sequences standard replication methods employed in molecular biology can be used 
such as the use of M13 for single stranded DNA as described by Messing, J., Methods 
Enzymol., 1983, 101 :20-78: or the use of polymerase chain reaction as described in U.S. 
Patent Nos. 4,683,195; 4,683,202 and 4,965,188. 

Other methods of polynucleotide synthesis include phosphotriester and 

30 phosphodiester methods (Narang, S. A. et al., Meth. Enzymol, 1979, 68:90) and synthesis 
on a support (Beaucage, et al., Tetrahedron Letters, 1981, 22:1859-1862) as well as 
phosphoramidate techniques (Caruthers, M. H. , et al., Methods in Enzymology, 1988, 
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154:287-314) and others described in Synthesis and Applications ofDNA andRNA 
(Narang, S. A., editor, Academic Press, New York, 1987), and the references contained 
therein. Sequential addition of nucleotide phosphoramidites to surface-linked hydroxyl 
groups is described by T. Brown and Dorcas J. S. Brown in Oligonucleotides and 
5 Analogues A Practical Approach, F. Eckstein, editor, Oxford University Press, Oxford, pp 
1-24 (1991). The chemical synthesis via a photolithographic method of spatially 
addressable arrays of oligonucleotides bound to glass surfaces is described by Pease, A. C. 
et al., Proc. Nat. Aca. Sci, 1994, 91:5022-5026. Deposition of pre-synthesized 
oligonucleotides may be accomplished by (1) covalent linkage of a chemically modified 

10 oligonucleotide (e.g. aliphatic primary amine) to the substrate surface bearing an amine- 
reactive group (e.g. aromatic isothiocyanate) as described in Guo Z. et al., Nucleic Acids 
Res, 1994, 22:5456-65, or (2) adsorption to a substrate surface coated with a positively 
charged polyelectrolyte (e.g. poly-L- lysine), followed by cross-linking to the surface 
chemically or photochemically (e.g. covalent stabilization via ultraviolet (UV) photo- 

15 crosslinking), as described in Schena, M. et al., Science, 1995, 270:467-70. Common 
deposition equipment used for forming arrays includes that described in Schena, M. et 
al.(cited above); Pease, A. C. et al., Proc. Natl Acad. Sci, 1994, 91:5022-6 and Blanchard, 
A. P. et al., Biosensors & Bioelectronics, 1996, 11:687-690. 

For purposes of this invention, the polynucleotide, or a cleaved fragment obtained 

20 from the polynucleotide, will usually be at least partially denatured or single stranded or 

treated to render it denatured or single stranded. Such treatments are well known in the art 
and include, for instance, heat or alkali treatment, or enzymatic digestion of one strand. 
For example, double stranded DNA (dsDNA) can be heated at 90-100°C for a period of 
about 1 to 10 minutes to produce denatured material, while RNA produced via 

25 transcription from a ds-DNA template is already single stranded. A polynucleotide can 
have from about 5 to 5,000,000 or more nucleotides. The larger polynucleotides are 
generally found in the natural state. In an isolated state the polynucleotide can have about 
30 to 50,000 or more nucleotides, usually about 100 to 20,000 nucleotides, more 
frequently 500 to 10,000 nucleotides. 

30 As used herein, the term "modified nucleotide" refers to a naturally occurring or a 

synthetic unit in a nucleic acid polymer that contains modifications to the base, sugar 
and/or phosphate groups. The modified nucleotide can be produced by a chemical 
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modification of a nucleotide either as part of the nucleic acid polymer or prior to the 
incorporation of the modified nucleotide into the nucleic acid polymer. For example, the 
methods mentioned above for the synthesis of an oligonucleotide may be employed. In 
another approach a modified nucleotide can be produced by incorporating a modified 
5 nucleoside triphosphate into the polymer chain during an amplification reaction. 

Examples of modified nucleotides, by way of illustration and not limitation, include 
dideoxynucleotides, derivatives or analogs that are biotinylated, amine modified, alkylated, 
fluorophore-labeled, and the like and also include phosphorothioate, phosphite, ring atom 
modified derivatives, and so forth. 
10 Modified nucleosides or nucleotides will also include modifications on the sugar 

moiety, e.g., wherein one or more of the hydroxyl groups are replaced with halogen, 
aliphatic groups, or are functionalized as ethers, amines, or the like. Additionally, 
modified nucleotides will also include abasic phosphodiesters (as described in, e.g., 
Example 5, infra). 

15 The term "nucleotidic unit" is intended to encompass nucleosides, nucleotides and 

modified nucleotides. 

Furthermore, modifications to nucleotidic units include rearranging, appending, 
substituting for or otherwise altering functional groups on the purine and/or pyrimidine 
base which form hydrogen bonds to a respective complementary pyrimidine or purine. 

20 The resultant modified nucleotidic unit may form a base pair with other such modified 
nucleotidic units but not with A, T, C, G or U. Standard A-T and G-C base pairs form 
under conditions which allow the formation of hydrogen bonds between the N 3 -H and C 4 - 
oxy of thymidine and the N 1 and C 6 -NH respectively, of adenosine and between the C 2 - 
oxy, N 3 and C 4 -NH 2 , of cytidine and the C 2 -NH 2 , N ! -H and C 6 -oxy, respectively, of 

25 guanosine. Thus, for example, guanosine (2-amino-6-oxy-9-P-D-ribofuranosyl-purine) 

may be modified to form isoguanosine (2-oxy-6-amino-9-P-D-ribofuranosyl-purine). Such 
modification results in a nucleoside base which will no longer effectively form a standard 
base pair with cytosine. However, modification of cytosine (l-(3-D-ribofuranosyl-2-oxy-4- 
amino-pyrimidine) to form isocytosine (l-P-D-ribofuranosyl-2-amino-4-oxy-pyrimidine) 

30 results in a modified nucleotide which will not effectively base pair with guanosine but 
will form a base pair with isoguanosine. Isocytosine is available from Sigma Chemical 
Co. (St. Louis, Mo.); isocytidine, 2'-deoxy-5-methyl-isocytidine, isoguanine nucleotides 
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may be prepared by the method described in the art. See U.S. Patent No. 5,681,702. Other 
such modified nucleotidic units which form unique base pairs have been described in 
Piccirilli et al. (1990) Nature 343:33-37 and Leach et al. (1992) /. Am. Chem. Soc. 
114 :3675-3683. or will be apparent to those of ordinary skill in the art. 
5 As used herein, the term "abasic phosphodiester" refers to a polymer comprising 

DNA analogs formed from chemically modified precursors unable to form hydrogen 
bonds, or in which the nitrogenous bases are absent, or wherein the entire deoxyribose 
sugar-nitrogenous base moiety has been replaced by a polyether structure (for further 
details see, e.g., Example 5, infra). Such abasic phosphodiesters possess polyelectrolyte 

10 properties similar to normal DNA and yield a surface with similar nonspecific binding 
properties as surfaces bearing normal DNA strands. 

The term "analyte" refers to a sample derived from a variety of sources such as 
from food stuffs, environmental materials, a biological sample or solid, such as tissue or 
fluid isolated from an individual, including but not limited to, for example, plasma, serum, 

15 spinal fluid, semen, lymph fluid, the external sections of the skin, respiratory, intestinal, 
and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs, and also samples 
of in vitro cell culture constituents (including but not limited to conditioned medium 
resulting from the growth of cells in cell culture medium, putatively virally infected cells, 
recombinant cells, and cell components). The analyte may contain a single- or double- 

20 stranded nucleic acid molecule which includes a target nucleotide sequence and may be 
prepared for hybridization analysis by a variety of means, e.g., using proteinase K/SDS, 
chaotropic salts, or the like. 

As used herein, the terms "target region" or "target nucleotide sequence" may be 
used interchangeably, and refers to a sequence of nucleotides to be identified, usually 

25 existing within a portion or all of a polynucleotide, usually a polynucleotide analyte. The 
identity of the target nucleotide sequence generally is known to an extent sufficient to 
allow preparation of various probe sequences hybridizable with the target nucleotide 
sequence. The term "target sequence" refers to a sequence with which a probe will form a 
stable hybrid under desired conditions. The target sequence generally contains from about 

30 30 to 5,000 or more nucleotides, preferably about 50 to 1,000 nucleotides. The target 

nucleotide sequence is generally a fraction of a larger molecule or it may be substantially 
the entire molecule such as a polynucleotide as described above. The minimum number of 
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nucleotides in the target nucleotide sequence is selected to assure that the presence of a 
target polynucleotide in a sample is a specific indicator of the presence of polynucleotide 
in a sample. The maximum number of nucleotides in the target nucleotide sequence is 
normally governed by several factors: the length of the polynucleotide from which it is 
5 derived, the tendency of such polynucleotide to be broken by shearing or other processes 
during isolation, the efficiency of any procedures required to prepare the sample for 
analysis (e.g. transcription of a DNA template into RNA) and the efficiency of detection 
and/or amplification of the target nucleotide sequence, where appropriate. 

As used herein the term "xenogene" refers to non-mammalian genes, i.e. genes not 
10 derived from a mammalian genome, preferably a non-human genes. Xenogenes may be 
derived from any non-mammalian source, such as plants, yeasts, bacteria, virus, and the 
like. 

A "hybridization probe", also termed a "normal probe" or a "real probe" herein, 
refers to a structure comprised of an oligonucleotide, as defined above, which contains a 

15 nucleic acid sequence complementary to a nucleic acid sequence present in the analyte of 
interest such that the hybridization probe will specifically hybridize to the target nucleotide 
sequence under appropriate conditions. 

A "background probe" or a "negative control probe" is defined as a probe that 
closely mimics the nonspecific binding properties of hybridization or real probes, but 

20 which possesses no specific affinity for target nucleic acid sequences. In a preferred 
embodiment, the region of the array that contains background probe will locally and 
globally yield minimal signal levels. The length of the "hybridization" and "background" 
probes is generally from about 5 to about 50 nucleotidic units, more preferably from about 
10 to about 30 nucleotidic units, and even more preferably from about 10 to about 25 

25 nucleotidic units. Additionally, the "hybridization" and "background" probes need not be 
the same length. 

A "positive control probe" refers to a structure comprised of an oligonucleotide, as 
defined above, which contains a first nucleic acid sequence complementary to a second 
nucleic acid sequence of interest such that the positive control probe will specifically 
30 hybridize to the second nucleic acid sequence under appropriate conditions (for further 
details see, e.g., Examples 1, 2 and 6 infra). 
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As used herein, the term "feature" is defined as a set of plurality of probes, wherein 
the probes may be in solution or are bound to a surface. In preferred embodiments, the 
probes are bound to a surface, wherein each set of probes is arranged in a spaced-apart 
relation to each other at known locations. In more preferred embodiments, a feature is the 
5 region of the array that contain probes, the features are separated by regions devoid of 
probes, and each feature occurs at approximately known locations and is distinct from 
other features. The ratio of hybridization features to background features, as defined 
below, is 1 to 10,000, preferably 10 to 5,000, more preferably 50 to 2,000. 

A "hybridization feature" is defined as a structure comprised of a plurality of 

10 hybridization probes that selectively hybridize to a detectably labeled target nucleotide 

sequence, wherein the target may be labeled prior to or after hybridization, preferably prior 
to hybridization, as defined above. In a preferred embodiment, a hybridization feature 
contains 3.1 x 10 6 to 6.3 x 10 7 hybridization probes, preferably 1.6 x 10 7 to 4.7 x 10 7 , more 
preferably 2.8 x 10 7 to 3.5 x 10 7 hybridization probes. 

15 A "background feature" is defined as a structure comprised of a plurality of 

background probes that do not selectively hybridize to the target nucleotide sequence, as 
defined above. In a preferred embodiment, a background feature is a region of an array 
that contains background probes covalently bound to the array- surface. In a preferred 
embodiment, a background feature contains 3.1 x 10 6 to 6.3 x 10 7 background probes, 

20 preferably 1.6 x 10 7 to 4.7 x 10 7 , more preferably 2.8 x 10 7 to 3.5 x 10 7 background probes. 

As used herein, a "standard-background feature" refers to a background feature 
comprising background probes which have been standardized/validated against the 
hybridization features according to the methods of the instant invention. 

As used herein, a "test-background feature" refers to a background probe feature 

25 comprising non-standard background probes, i.e., new and/or unknown background probes 
that have not been standardized/validated. Test-background features are validated against 
the standard-background features and the hybridization features according to the methods 
of the instant invention. A test-background feature is validated if the signal from the test- 
background probe is as low as, or lower than signal from the standard-background. 

30 Additionally, the signal replicates between the test-background probe features of a given 
sequence should be as good as the signal replicates of the standard-background features. 
Monitoring the inter-feature % coefficient of variation (%CV) is an example of testing for 
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accuracy of replication. The inter-feature %CV is calculated by dividing the inter-feature 
standard deviation by the inter-feature mean, and multiplying by 100. 

Two nucleotide sequences are "complementary" to one another when those 
molecules share base pair organization homology. "Complementary" nucleotide 
5 sequences will combine with specificity to form a stable duplex under appropriate 

hybridization conditions. For instance, two sequences are complementary when a section 
of a first sequence can bind to a section of a second sequence in an anti-parallel sense 
wherein the 3 '-end of each sequence binds to the 5 '-end of the other sequence and each A, 
T(U), G, and C of one sequence is then aligned with a T(U), A, C, and G, respectively, of 

10 the other sequence. RNA sequences can also include complementary G=U or U=G base 
pairs. Thus, two sequences need not have perfect homology to be "complementary" under 
the invention, and in most situations two sequences are sufficiently complementary when 
at least about 85% (preferably at least about 90%, and most preferably at least about 95%) 
of the nucleotides share base pair organization over a defined length of the molecule. 

15 As used herein, the terms "reporter," "label" and "detectable label" refer to a 

molecule capable of detection, including, but not limited to, radioactive isotopes, 
fluorescers, chemiluminescers, enzymes, enzyme substrates, enzyme cofactors, enzyme 
inhibitors, dyes, metal ions, metal sols, ligands (e.g., biotin or haptens) and the like. The 
term "fluorescer" refers to a substance or a portion thereof which is capable of exhibiting 

20 fluorescence in the detectable range. The term "cofactor" is used broadly herein to include 
any molecular moiety which participates in an enzymatic reaction. Particular examples of 
labels which may be used under the invention include fluorescein, 5(6)- 
carboxyfluorescein, Cyanine 3 (Cy3), Cyanine 5 (Cy5), rhodamine, dansyl, umbelliferone, 
Texas red, luminol, NADPH, a,P-galactosidase and horseradish peroxidase. 

25 The term "substrate" is used interchangeably herein with the terms "support" and 

"solid substrate," and denotes any solid support suitable for immobilizing one or more 
nucleic acid molecules as discussed further below. 

As used herein, the term "array" is defined as a collection of separate probes each 
arranged in a spatially defined and a physically addressable manner. The number of probes 

30 or features that can be deposited on an array will largely be determined by the surface area 
of the substrate, the size of a feature and the spacing between features, wherein the array 
surface may or may not comprise a local background region represented by non-feature 
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area. Generally, arrays can have densities of up to several hundred thousand or more 
features per cm 2 , preferably about 2,500 to about 200,000 features/cm 2 . 

As used herein, the term "observed signal" is defined as the amount of signal 
generated from contacting the target nucleotide sequence with the features comprising 
5 oligophosphodiester probes, wherein the target nucleotide sequence may be labeled prior 
to or after hybridization, preferably prior to hybridization. The observed signal is a 
combination of the "real" or "hybridization" signal, i.e. the signal generated from the 
hybridization of the labeled target nucleotide sequence with the hybridization probes, and 
the background signal, as defined below. 

10 As used herein, the term "background signal" is defined as the amount of signal 

generated from the background features, and the signal generated due to an undesirable 
interaction of various components used in a given assay, i.e. signal generated by entities 
other than due to hybridization of the analyte and the specified complementary probes, 
such as signal generated from (i) the reporter, i.e. a signal arising from the label itself when 

15 it is not attached to the target, such as a signal generated from a fluorescent dye used in 
labeling the target; (ii) the non-reporter, i.e. a signal generated from the substrate or other 
assay components, and (iii) signal generated due to non-specific binding of probes to 
labeled entities other than their specific target molecules, i.e. binding not related to 
hybridization of the analyte and the complementary probes. 

20 As used herein, the term "local background signal" is defined as the signal 

generated from the portion of the array outside of the probe feature area, generally, the area 
immediately adjacent to a given feature. 

As used herein, the term "global background signal" is determined by calculating 
either the minimum or the average value of all the local background signals from an entire 

25 array. 

As used herein, the term "non-reporter signal" is defined as a signal generated from 
the substrate or other assay components, not from the reporter. 

As used herein, the term "Lowest Limit of Detection (LLD)" is defined as the 
lowest concentration of analyte that yields a signal which is statistically significantly 
30 greater than the background signal. Generally, the signal will be greater than the sum of 
the background signal and the noise of the background signal measurement. This sum of 
background signal and noise is referred to as the threshold. Thus, LLD is an important 
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analytic method performance parameter that is particularly sensitive to background 
measurement methods. Often, two standard deviations of the sampled background 
distribution (2a) is used to track noise, if the noise is normally distributed. Alternatively, a 
composite estimation of noise that tracks both observed signal noise and background 
5 signal noise is calculated. If the estimation of noise uses the sum of the background signal 
and two standard deviations as a threshold, the resulting net signal which is calculated will 
have a confidence of 95% of being different from the background. 

As used herein, the term "pixel signal variation" is defined as a method of 
calculating noise of a feature or noise of the local background. The signal from a feature 

10 or signal from a local background region is reported as a statistically relevant value (e.g., 
average or median) of all the individual pixel signals within that feature or local 
background region. The standard deviation of the pixel signals within a feature or local 
background region can be used to characterize the pixel statistics of that feature or region. 
The percent coefficient of variation (%CV) is calculated by dividing the intra-feature 

1 5 standard deviation by the intra-feature mean, and multiplying by 100. 

B. General Methods 

Before describing the present invention in detail, it is to be understood that this 

invention is not limited to particular formulations or process parameters as such may, of 
20 course, vary. It is also to be understood that the terminology and examples used herein are 

for the purpose of describing particular embodiments of the invention only, and are not 

intended to be limiting. 

Although a number of compositions and methods similar or equivalent to those 

described herein can be used in the practice of the present invention, the preferred 
25 materials and methods are described. 

The present invention is based on the discovery of an accurate and reliable method 

for reducing background effects in nucleic acid hybridization assays. In particular, the 

method utilizes an array of oligophosphodiester probes wherein the array includes 

hybridization probes that selectively hybridize to a target nucleotide sequence, and 
30 background probes that do not selectively hybridize to the target nucleotide sequence. The 

probes are reacted with an analyte suspected of including the target nucleotide sequence, 

wherein the target nucleotide sequence is labeled, and further wherein the target nucleotide 
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sequence may be labeled prior to or after hybridization, preferably prior to hybridization. 
The resulting hybridization mixture is then analyzed to detect (i) the observed signal, i.e. 
the amount of signal generated from contacting the target nucleotide sequence with the 
features comprising oligophosphodiester probes, and (ii) the amount of background signal 
5 generated from a variety of sources of background signal, including non-specific binding 
of the labeled target nucleotide sequence to the background probes. Finally, the presence 
and/or amount of the target nucleotide sequence in the analyte is determined by subtracting 
the background signal from the observed signal of the labeled target nucleotide sequence 
with the hybridization probes. 

10 Probes for use in the subject invention include "hybridization" or "normal" or 

"real" probes; "background" or "negative control" probes and "positive control" probes. A 
hybridization probe or a real probe that binds to a target molecule is preferably one which 
binds to that target with high specificity. Preferably, the probe is covalently attached to the 
array surface. A probe-nucleic acid molecule will be specific for a nucleic acid target 

15 molecule with a base sequence complementary to the probe nucleic acid molecule. The 
probe-nucleic acid molecule hybridizes to the target molecule. 

As explained above, a variety of nucleic acid molecules can be used to form the 
hybridization probes. See, generally, Wetmur, J. (199 1) Crit Rev Biochem and Mol Bio 
26:227. In particular, modified oligonucleotides can be used to increase selectivity and 

20 sensitivity of the probes. Such modified oligonucleotides are well known in the art and 
described in e.g., Chollet et al. (1988) Nucleic Acids Res 16:305; Potapov et al. (1996) 
Pure &Appl. Chem 68: 1315; Soloman et al. (1993) J Org Chem 58:2232; Prosnyak et al. 
(1994) Genomics 21:490; Lin et al. (1991) Nucleosides & Nucleotides 10:675. For 
example, substitution of 2-aminoadenine for adenine, or substitution of 5-methylycytocine 

25 for cytosine can increase duplex stability. Prosnyak et al, supra. In addition, 

oligonucleotide probes containing both types of modified bases have increased duplex 
stability relative to unmodified analogs. Furthermore, substitution of 2-aminoadenine (2- 
AA) for adenine creates an additional hydrogen bond in the Watson-Crick base pair 
(Chollet et al., supra), and oligonucleotide probes containing 2- A A show increased 

30 selectivity and hybridization to target DNA. In this regard, 2-AA is used only as a 

substitute for adenine, and binds in a manner similar to the natural base. Other examples 
of modified oligonucleotides include the use of a base pair wherein a modified pyridone or 
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quinolone base pairs with 2-aminopurine (Solomon et al., supra), and the use of 
deoxycitidine derivatives in triplex formation (Huang et al. (1996) Nucleic Acids Res. 
14:2606). 

Hybridization probes for use with the present methods may be assembled using a 

5 combination of solid phase direct oligophosphodiester synthesis, enzymatic ligation 

methods, and solution phase chemical synthesis. Various methods for synthesizing probes 
are well known in the art. For example, it is a matter of routine to synthesize desired 
nucleic acid probes using conventional nucleotide phosphoramidite chemistry and 
instruments available from, e.g., Applied Biosystems, Inc., (Foster City, CA), Dupont 

10 (Wilmington, DE), or Milligen (Bedford MA). Thus, all chemical syntheses of 

oligophosphodiesters can be performed on an automatic DNA synthesizer, such as a Perkin 
Elmer/ Applied Biosystems Division model 380 B. For example, phosphoramidite 
chemistry of the P-cyanoethyl type can be used including 5 '-phosphorylation which 
employs PHOSTEL™ reagent (DMT-0-CH 2 -CH 2 -(S0 2 )-CH 2 -CH 2 -0-P(N(iPr) 2 )(-0-CH 2 

15 CH 2 CN) wherein DMT is dimethoxytrityl and iPr is isopropyl. 

Hybridization probes can be provided that hybridize with a variety of nucleic acid 
targets, such as viral, prokaryotic, and eukaryotic targets. The target may be a DNA target 
such as a gene (e.g., oncogene), control element (e.g., promoter, repressor, or enhancer), or 
sequence coding for ribosomal RNA, transfer RNA, mRNA, or RNase P. The target may 

20 be a viral genome or complementary copy thereof. Additionally, the target may be a 

"nucleic acid amplification product," e.g., a nucleic acid molecule, either DNA or RNA, 
resulting from the introduction of an enzyme or enzymes into the cell, wherein such 
enzymes make a nucleic acid molecule complementary to one already present in the cell. 
See, e.g, O. Bagasra et al. (1992) The New England Journal of Medicine 326:1385-1391. 

25 A background probe or a negative control probe, unlike the hybridization probe, is 

a probe that binds to target molecules only minimally and is preferably one which closely 
mimics the nonspecific binding properties and the non-reporter signal generation 
properties of the hybridization probe. In the case of surface-bound probes, the surface of 
background probe features also preferably mimic the surface of hybridization features as to 

30 their non-reporter signal generation and their non-specific binding of reporter and target. 
Background probes possess no specific affinity for target nucleic acid sequences. 
Preferably, a background probe is covalently attached to the array surface (background 
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probe feature). Background probes of the invention include, but are not limited, to 
empirically observed probes; probes that form stable intramolecular structures, such as 
hairpins and pseudo-half knots; short probes; short probes; probes comprising reverse 
polarity nucleotide analogs; probes comprising abasic phosphodiesters or modified 
5 nucleotidic units, and the like. 

Examples of empirically observed inactive probes are shown in Table 1, infra. In 
particular, these probes have been observed to bind their complementary targets very 
minimally, yielding minimal signal levels in hybridization assays and as such are useful as 
background probes in the methods of the subject invention. The probes shown in Table 1 
10 are from sequences originally designed to bind human G3PDH (SEQ ID NO: 1) and P53 
(SEQ ID NO: 4) targets. When the probes were allowed to hybridize to their 
complementary specific targets, very poor binding was observed. Subsequently, other 
purified targets, as well as complex pool RNA, were also observed to bind very poorly to 
these probes. 

1 5 Examples of probes including intramolecular hairpin structures are shown in Table 

2, infra. These probes possess nonspecific binding properties similar to those of 
hybridization probes. Such probes are single-stranded oligonucleotides which include 
regions of internal complementarity such that double-stranded loops are formed by base- 
pairing between sequences in the strand which are complementary. In a preferred 

20 embodiment, such probes are single-stranded oligonucleotides which include regions of 
internal complementarity such that double-stranded loops are formed by base-pairing 
between sequences in the strand which are complementary and opposite in polarity. 

Examples of short probes are shown in Table 5, infra (for further details, see, e.g., 
Example 4, infra). Other useful background probes include those that comprise reverse 

25 polarity nucleotide analogs, i.e. probes wherein the deoxyribose sugar-nitrogenous base 
backbone comprises nucleotides attached such that they are in opposite polarity as 
compared to the adjacent nucleotides. These nucleotides are capable of forming 
oligonucleotides with alternating {3'->-3'} and {5'->-5'} phosphodiester linkages, instead 
of the naturally occurring {3'->-5' } phosphodiester linkages. (See, e.g., U.S. Patent Nos. 

30 5,399,676; 5,527,899 and 5,721,218 and Koga, M. et al. (1991) J. Org. Chem. 56:3757- 
3759). Generally, the probe comprises about 1 to about 50 reverse polarity nucleotide 
analogs, more preferably about 2 to about 25 reverse polarity nucleotide analogs, and even 
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more preferably about 5 to about 10 reverse polarity nucleotide analogs. Such probes are 
synthesized using techniques well known in the art. (See, e.g., Koga, M. et al.). 

In another embodiment, the background probe comprises abasic phosphodiesters or 
modified nucleotidic units, which minimize nonspecific hybridization and nonspecific 
5 binding. Examples of abasic phosphodiesters include, but are not limited to, analogs of 
modified DNA, wherein the substitutents are replaced, or wherein the deoxyribose sugar- 
phosphodiester backbone with nitrogenous bases is substituted with a sugar- 
phosphodiester backbone without nitrogenous bases, or a backbone comprising polyether, 
and the like. Examples of phosphoramidite monomers used to synthesize an abasic 

10 phosphodiester, using standard chemical synthetic techniques, are illustrated in Figure 6 

and are commercially available (Glen Research, Sterling, VA). For example, in structure I, 
the nitrogenous base in the deoxyribose sugar ring is replaced with hydrogen. In structure 
n, the deoxyribose sugar ring is replaced by a triethylene glycol unit. Analogs of Structure 
II are obtained by replacing triethylene glycol with monoethyleneglycol and 

15 hexaethyleneglycol. Compounds of Structures I and II are commercially available (Glen 
Research, Sterling, VA). Preferably, these oligophosphodiesters are prepared using 
standard phosphoramidite-based synthetic methods. 

Various methods for forming arrays from pre-formed probes described above, or 
methods for generating the array using synthesis techniques to produce the probes in situ, 

20 are generally known in the art. See, for example, Southern, U.S. Patent No. 5,700,637; 
Pirrung, et al., U.S. Patent No. 5,143,854; PCT International Publication No. WO 
92/10092; and, Fodor, et al. (1991) Science 251:767-777. 

For example, probes can either be synthesized directly on the solid support or 
substrate to be used in the hybridization reaction or attached to the substrate after they are 

25 made. A variety of solid supports or substrates may be used to practice the method of the 
invention. In a preferred embodiment the substrate comprises a porous or non-porous 
water insoluble material. The substrate may be selected from a wide variety of materials 
including, but not limited to, inorganic powders such as silica, magnesium sulfate, and 
alumina; natural polymeric materials, particularly cellulosic materials and materials 

30 derived from cellulose, such as fiber containing papers, e.g., filter paper, chromatographic 
paper, etc.; synthetic or modified naturally occurring polymers, such as nitrocellulose, 
cellulose acetate, poly (vinyl chloride), polyamides, polyacrylamide, polyacrylate, 
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polymethacrylate, polyesters, polyolefins, polyethylene, polytetrafluoro-ethylene, 
polypropylene, poly (4-methylbutene), polystyrene, poly(ethylene terephthalate), nylon, 
poly( vinyl butyrate), cross linked dextran, agarose, etc.; either used by themselves or in 
conjunction with other materials; fused silica (e.g., glass), bioglass, silicon chips, ceramics, 
5 metals, and the like. For example, substrates include polystyrene, to which short 

oligophosphodiesters, e.g., oligonucleotides ranging from about 5 to about 50 nucleotides 
in length, may readily be covalently attached (Letsinger et al. (1975) Nucl. Acids Res. 
2:773-786), as well as polyacrylamide (Gait et al. (1982) Nucl. Acids Res. 10:6243-6254), 
silica (Caruthers et al. (1980) Tetrahedron Letters 21:719-722), and controlled-pore glass 

10 (Sproat et al. (1983) Tetrahedron Letters 24:5771-5774). Natural or synthetic assemblies 
such as liposomes, phospholipid vesicles, and cells can also be employed as substrates. 
Additionally, the substrate can be hydrophilic or capable of being rendered hydrophilic. 

Suitable substrates may exist, for example, as gels, sheets, tubing, spheres, 
containers, pads, slices, films, plates, slides, strips, plates, disks, rods, particles, beads, etc. 

15 The substrate is preferably flat, but may take on alternative surface configurations. The 
substrate can be a flat glass substrate, such as a conventional microscope glass slide, a 
cover slip and the like. Common substrates used for the arrays of probes are surface- 
derivatized glass or silica, or polymer membrane surfaces, as described in Guo, Z. et al. 
(cited above) and Maskos, U. et al., Nucleic Acids Res, 1992, 20: 1679-84 and Southern, E. 

20 M. et al., Nucleic acids Res, 1994, 22: 1368-73. 

Immobilization of the probe to a suitable substrate may be performed using 
conventional techniques. See, e.g., Letsinger et al. (1975) Nucl. Acids Res. 2:773-786; 
Pease, A.C. et al., Proc. Nat. Acad. Sci. USA, 1994, 91:5022-5026. and "Oligonucleotide 
Synthesis, a Practical Approach," Gait, MJ. (ed.), Oxford, England: IRL Press (1984). 

25 The surface of a substrate may be treated with an organosilane coupling agent to 

functionalize the surface. One exemplary organosilane coupling agent is represented by 
the formula R n SiY (4 _ n) wherein: Y represents a hydrolyzable group, e.g., alkoxy, typically 
lower alkoxy, acyloxy, lower acyloxy, amine, halogen, typically chlorine, or the like; R 
represents a nonhydrolyzable organic radical that possesses a functionality which enables 

30 the coupling agent to bond with organic resins and polymers; and n is 1, 2 or 3, usually 1. 
One example of such an organosilane coupling agent is 3-glycidoxypropyltrimethoxysilane 
("GOPS"), the coupling chemistry of which is well-known in the art. See, e.g., Arkins, 
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"Silane Coupling Agent Chemistry," Petrarch Systems Register and Review, Eds. 
Anderson et al. (1987). Other examples of organosilane coupling agents are (y- 
aminopropyl)triethoxysilane and (y- aminopropyl)trimethoxysilane. Still other suitable 
coupling agents are well known to those skilled in the art. Thus, once the organosilane 
5 coupling agent has been covalently attached to the support surface, the agent may be 

derivatized, if necessary, to provide for surface functional groups. In this manner, support 
surfaces may be coated with functional groups such as amino, carboxyl, hydroxyl, epoxy, 
aldehyde and the like. 

Use of the above functionalized coatings on a solid support provides a means for 

10 selectively attaching oligophosphodiesters to the support. Thus, an oligonucleotide probe 
formed as described above may be provided with a 5'-terminal amino group which can be 
reacted to form an amide bond with a surface carboxyl using carbodiimide coupling 
agents. 5' attachment of the oligonucleotide may also be effected using surface hydroxyl 
groups activated with cyanogen bromide to react with 5'-terminal amino groups. 3'- 

15 terminal attachment of an oligonucleotide probe may be effected using, for example, a 
hydroxyl or protected hydroxyl surface functionality. 

An array for use with the present invention will generally be constructed such that 
the ratio of hybridization features to background features is between about 1 to about 
10,000 , preferably between about 10 to about 5,000 , more preferably between about 50 to 

20 about 2,000. 

The analyte suspected of containing the target nucleotide sequence is labeled with 
detectable labels, prior to or after hybridization, preferably prior to hybridization, using 
techniques known in the art. In some embodiments, a target nucleotide sequence includes 
a label associated with a nucleic acid molecule capable of binding specifically to the probe. 

25 In one embodiment, the target nucleotide sequence is directly labeled with a detectable 
label, wherein the label may be covalently or non-covalently attached to the target 
nucleotide sequence. For example, the target nucleotide sequence may be labeled with 
biotin, exposed to hybridization conditions, wherein the labeled target nucleotide sequence 
binds to an avi din-label or an avidin-generating species. (Also see Example 1, infra). In 

30 an alternative embodiment, the target nucleotide sequence is indirectly labeled with a 
detectable label, wherein the label may be covalently or non-covalently attached to the 
target nucleotide sequence. For example, the label may be non-covalently attached to a 
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linker group, which in turn is (i) covalently attached to the target nucleotide sequence, or 
(ii) comprises a sequence which is complementary to the target nucleotide sequence. In 
another example, the probes may be extended, after hybridization, using chain-extension 
technology or sandwich-assay technology to generate a detectable signal (see, e.g., U.S. 
5 Patent No. 5,200,314). Generally, such detectable labels include, but are not limited to, 
radioactive isotopes, fluorescers, chemiluminescers, enzymes, enzyme substrates, enzyme 
cofactors, enzyme inhibitors, dyes, metal ions, metal sols, ligands (e.g., biotin or haptens) 
and the like. 

In one embodiment, the label is a fluorescent compound, i.e., capable of emitting 

10 radiation (visible or invisible) upon stimulation by radiation of a wavelength different from 
that of the emitted radiation, or through other manners of excitation, e.g. chemical or non- 
radiative energy transfer. The label may be a fluorescent dye. Preferably, a target with a 
fluorescent label includes a fluorescent group covalently attached to a nucleic acid 
molecule capable of binding specifically to the complementary probe nucleotide sequence. 

15 Fluorescent groups useful as labels in this invention include, but are not limited to, 
fluorescein (or FITC), Texas Red, coumarin, rhodamine, rhodamine derivatives, 
phycoerythrin, Perci-P, 4-methylumbelliferyl phosphate, resorufin, 7-diethylamino 
coumarin-3- carboxylic acid succinimidyl ester, and the like. Fluorescent groups having 
near infrared fluorescence include, but are not limited to, indocyanine green [CAS 

20 3599-32-4], copper phthalocyanine [CAS 147-14-8], 3, 3 '-diethyl- 19,1 1:15,17- 
dienopentylene-2,2'-thiapentacarbocyanine, and the like. 

Additionally, the label may be an aromatic compound (having one or more benzene 
or heteroaromatic rings or polycyclic aromatic or heteroaromatic structures). Labels for 
use in the present invention may also include chemiluminescent groups such as, but are not 

25 limited to, isoluminol (4-aminophthalhydrazide), and the like. In an additional 

embodiment, the label is a protein or an enzyme. In a preferred embodiment, the enzyme 
is capable of catalyzing a reaction that produces a detectably labeled product. 

Methods for attaching labels to target nucleotide sequence are similar to the 
methods for attaching labels to probes which are well known in the art. Enzo Biochemical 

30 (New York, NY), Clontech Laboratories, Inc. (Palo Alto, CA) and Ambion, Inc. (Austin, 
TX) (see Example 1, infra) have described and commercialized polynucleotide-labeling 
techniques. (See e.g., U.S. Patent Nos. 5,260,433; 5,241,060; 4,994,373; 5,401,837 and 
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5,141,183). For example, a primary amine can be attached to a 3' oligo terminus or a 5' 
oligo terminus. The amines can be reacted to various haptens using conventional 
activation and linking chemistries. International Publication Nos. WO 92/10505 and WO 
92/1 1388 teach methods for labeling polynucleotides at their 5' and 3' ends, respectively. 
5 According to one known method for labeling an oligonucleotide, a label-phosphoramidite 
reagent is prepared and used to add the label to the oligonucleotide during its synthesis. 
See, for example, N.T. Thuong et al. (1988) Tet. Letters 29:5905-5908. Preferably, target 
polynucleotides are labeled multiple times by inclusion of labeled nucleotides during target 
oligonucleotide synthesis. 

10 Once an array has been formed with the hybridization and background probes, it is 

contacted with an analyte suspected of containing a target nucleotide sequence and 
incubated under suitable hybridization conditions. Hybridization generally takes from 
about 30 minutes to about 24 hours, and occurs at the highest specificity approximately 10- 
25°C below the temperature (T m ) at which the nucleotide hybrid is 50% melted. The T m 

15 for a particular hybridization pair will vary with the length and nature of the nucleotides 
and may be readily determined by those of ordinary skill in the art. 

Generally, a nucleic acid molecule is capable of hybridizing selectively or 
specifically to a target sequence under moderately stringent hybridization conditions. In 
the context of the present invention, moderately stringent hybridization conditions 

20 generally allow detection of a target nucleic acid sequence of at least 14 nucleotides in 
length having at least approximately 70% sequence identity with the sequence of the 
selected nucleic acid probe. In another embodiment, such selective hybridization is 
performed under stringent hybridization conditions. Stringent hybridization conditions 
allow detection of target nucleic acid sequences of at least 14 nucleotides in length having 

25 a sequence identity of greater than 90% with the sequence of the selected nucleic acid 

probe. Hybridization conditions useful for probe/target hybridization where the probe and 
target have a specific degree of sequence identity, can be determined as is known in the art 
(see, for example, Nucleic Acid Hybridization: A Practical Approach, editors B.D. Hames 
and S.J. Higgins, (1985) Oxford; Washington, DC; IRL Press). 

30 With respect to stringency conditions for hybridization, it is well known in the art 

that numerous equivalent conditions can be employed to establish a particular stringency 
by varying, for example, the following factors: the length and nature of probe and target 
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sequences, base composition of the various sequences, concentrations of salts and other 
hybridization solution components, the presence or absence of blocking agents in the 
hybridization solutions (e.g., formamide, dextran sulfate, and polyethylene glycol), 
hybridization reaction temperature and time parameters, as well as, varying wash 
5 conditions. The selection of a particular set of hybridization conditions is well within the 
skill of a person of ordinary skill in the art (see, for example, Sambrook, et al., Molecular 
Cloning: A Laboratory Manual, Second Edition, (1989) Cold Spring Harbor, N.Y.). 

In general, hybridization is carried out in a buffered aqueous medium typically 
formulated with a salt buffer, detergents, nuclease inhibitors and chelating agents, using 

10 techniques well-known to those skilled in the art. Such formulations may be selected to 
preclude significant nonspecific binding of nucleotides with the support-bound array. 
Various solvents may be added to the medium such as formamide, dimethylformamide and 
dimethylsulfoxide, and the stringency of the hybridization medium may be controlled by 
temperature, pH, salt concentration, solvent system, or the like. Defining appropriate 

15 hybridization conditions is within the skill of the art. See, e.g., Sambrook, Fritsch & 
Maniatis, Molecular Cloning: A Laboratory Manual, Second Edition (1989). 

Following hybridization, the label is detected using colorimetric, fluorimetric, 
chemiluminescent or bioluminescent means. Fluorescent labels are detected by allowing 
the fluorescent molecule to absorb a energy and then emit some of the absorbed energy; 

20 the emitted energy is then detected using fluorimetric means. Preferably, the fluorescent 
dye is excitable by inexpensive commercially available lasers (e.g. HeNe, Micro Green, or 
solid state), has a quantum yield greater than 10%, exhibits low photo-bleaching and can 
be easily incorporated into target. In a preferred embodiment, when the target is labeled 
with R6G (Rhodamine-6-G), the label is detected by exciting at about 480nm to about 550 

25 nm, preferably at about 524 nm, and measuring light emitted at wavelengths at about 530 
nm to about 610 nm, preferably at about 557 nm. Generally, reasonable precautions are 
taken to minimize the concentration of species that absorb the excitation energy and emit 
in the detection range. 

Chemiluminescent label groups are detected by allowing them to enter into a 

30 reaction, e.g., an enzymatic reaction, that results in the emission of energy in the form of 
light. Other labels, e.g. biotin, may be detected because they can bind to groups such as 
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streptavidin which are bound, directly or indirectly to enzymes, e.g. (alkaline phosphatase 
or horseradish peroxidase) that can catalyze a detectable reaction. 

The methods described herein are useful in conjunction with any number of assay 
formats, such as in situ hybridization assays, Southern blotting, Northern blotting, dot blots 
5 and PCR reactions, including assays wherein the probes may be in solution or are bound to 
a surface. 

The present invention also provides test kits with one or more containers such as 
vials or bottles, with each container containing a separate component for the assay, such as 
an array, and reagents for carrying out nucleic acid hybridization assays according to the 

10 invention. Thus, the kit will comprise in packaged combination, an array of 

oligophosphodiester probes capable of generating a measurable signal when hybridized or 
bound to target, wherein the array comprises hybridization probes that selectively 
hybridize to the detectably labeled target nucleotide sequence, and background probes that 
do not selectively hybridize to the target nucleotide sequence. The kit may also include a 

15 denaturation reagent for denaturing the analyte, hybridization buffers, wash solutions, 

enzyme substrates, negative and positive controls and written instructions for carrying out 
the assay. 

EXAMPLES 

20 The following examples are illustrative in nature, and are not intended to limit the 

scope of the present invention in any manner. 

Example 1 
Empirically observed inactive probes 
25 In the process of trying to discover and validate probes for arrays that are sensitive 

to finding their targets, several probes that do not hybridize against their specific target 
were discovered. This example used in situ oligonucleotide probe arrays and R6G-labeled 
cRNA of the G3PDH gene (human glyceraldehyde-3-phosphate dehydrogenase gene), 
wherein the target polynucleotide (SEQ ID NO: 1) is the Watson-Crick complement of the 
30 mRNA. The probes in each feature used the sense-strand sequence of the G3PDH gene as 
input. The design produced 25-mer probes that were spaced at 10 base intervals along the 
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sequence, generating an array of 200 features which were regularly spaced across the entire 
G3PDH sequence. 

The oligonucleotide probes can be synthesized in situ, using standard methods of 
sequential phosphoramidite addition. Alternatively, the oligonucleotide probes can be 
5 synthesized by conventional chemosynthetic techniques (e.g. phosphoramidite chemistry) 
or by biosynthetic techniques (e.g. polymerase chain reaction "PCR"), printed onto the 
array surface, and covalently linked to that surface. See Brown T. et al., Oligonucleotides 
and Analogues A Practical Approach, and Schena M. et al., Science. The probe 
ingredients are printed or added to the feature locations of the substrate surface preferably 

10 using a modified thermal or piezoelectric inkjet-printing device. Pre-synthesized probes or 
probe compositions are printed, using the preferred printing device, on any of the array 
substrates mentioned previously, and preferably, are immobilized on the substrate using a 
poly-L-lysine coated substrate surface. For the purposes of the invention, the array of 
probes may be fabricated using conventional photolithography techniques as well. See, for 

15 example, Pease A. C. et al., Proc. Natl. Acad. Sci. USA. 

R6G-labeling of G3PDH cRNA (SEQ ID NO: 1) was accomplished by the 
following method. DNA templates containing T-7 RNA polymerase promoter sites were 
transcribed into RNA using Ambion's MAXIscript In Vitro Transcription Kit (Ambion, 
Inc., Austin, TX). Further, using the protocol for the addition of labeled nucleotide in 

20 RNA transcription reactions as described inAmbion's MAXIscript kit, R6G labeled CTP 

nucleotide was incorporated into RNA (Rhodamine-6G-CTP, CAS # 989-38-8, from NEN 
Life Sciences; Boston, MA). 

Hybridization conditions were as follows. The buffer consisted of 6x SSPE 
(sodium chloride/sodium phosphate/ethylenediamine tetra-acetic acid (EDTA)), 0.005% 

25 Triton X- 1 00, 0. 1 % w/v SDS (sodium dodecyl sulfate), 0. 1 % w/v BS A (bovine serum 
albumin, fraction V), 100 ug/ml hsDNA (heat-denatured herring sperm DNA). SSPE 
buffer components are described by Sambrook J. et al, in Molecular Cloning: A 
Laboratory Manual (Vol. 3, p. B 13; 2 nd Ed., 1989; Cold Spring :Harbor Laboratory Press). 
The target, R6G-labeled G3PDH cRNA (SEQ ID NO: 1), was present at 1 nM. 

30 The hybridization solution also included the positive control oligonucleotide, 

TAR25C, at 100 pM. TAR25C was used as the 5'-Cy3 (Cy3-TAR25C) (SEQ ID NO: 3 ). 
The positive control oligonucleotide hybridizes with the positive control probe, PR025G 



-30- 



Attorney Docket No. 10981620-1 



(SEQ ID NO: 2), on the array surface. The labeled TAR25C oligonucleotides were 
prepared and HPLC-purified by Operon Technologies, Inc. (Alameda, CA). The PR025G 
(SEQ ID NO: 2) and 5'-Cy3-TAR25C (SEQ ID NO: 3) sequences are: 
SEQ ID NO: 2 

5 5 ' - ATC ATCGT AGCTGGTCAGTGTATCC-3 ' 

SEQ ID NO: 3 

5 ' -Cy 3 -GG AT AC ACTGACC AGCT ACG ATG AT-3 ' 
The target solution (400 ul) was allowed to hybridize with the array in a 
hybridization apparatus overnight (18 to 20 hours) at 37°C with mixing on a rotary table. 
10 Arrays were washed by first removing the hybridization target/buffer solution, 

flushing the hybridization chamber with 6X SSPE/0.005% TX100, disassembling the 
hybridization chamber, and washing the array in a beaker of O.lx SSPE/0.005% TX100 at 
room temperature for 15 minutes with mixing. 

The scanning equipment typically used for the evaluation of hybridized arrays 
15 includes a scanning fluorometer and is commercially available from different sources, such 
as Molecular Dynamics of Sunnyvale, CA, General Scanning of Watertown, MA, Hewlett 
Packard of Palo Alto, California and Hitachi USA of South San Francisco, CA. Analysis 
of the data, (i.e., collection, reconstruction of image, comparison and interpretation of 
data) is performed with associated computer systems and commercially available software, 
20 such as EVIAGEQUANT™ by Molecular Dynamics (Sunnyvale, CA) or GENECHIP™ by 
Affymetrix (Santa Clara, California). 

The light source, typically from a laser, generates a collimated beam. The 
collimated beam sequentially illuminates small surface regions of known location. The 
resulting fluorescence photons from the surface regions are collected either confocally or 
25 non-confocally. The collected photons are transmitted through appropriate spectral filters, 
to an optical detector. A recording device, such as a computer memory, records electronic 
signals from the detector and builds up a raster scan file of intensities as a function of 
position, or time as it relates to the position. Such intensities, as a function of position, are 
referred to as "pixels". The pixels within a region centered upon the expected or intended 
30 position of a feature can be averaged to yield the relative quantity of target hybridized to 
the probe in that feature, if the expected or intended position of the feature is sufficiently 
close to its true position. For a discussion of the optical scanning equipment, see e.g., U. 
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S. Patent No. 5,760,951 (confocal scanner) and U. S. Patent No. 5,585,639 (off axis 
scanner). 

Figure 1 illustrates the results of a hybridization assay, wherein arrays containing 
probes designed to hybridize with G3PDH cRNA were hybridized to R6G-labeled G3PDH 
5 cRNA (SEQ ID NO: 1). The features that yielded minimal signal (labeled as Background 
Features in Figures 1 and 2) were tested on multiple arrays for their ability to hybridize to 
their specific labeled G3PDH cRNA target and in all cases were found to yield minimal 
signal. These empirically observed background probes are shown in Table 1. 
Additionally, probes that were designed to hybridize to cRNA of a portion of the P53 gene 
10 (human tumor suppressor p53 gene; target polynucleotide (SEQ ID NO: 4) is the Watson- 
Crick complement of the mRNA) and those found to yield minimal signal with Re- 
labeled P53 cRNA (SEQ ID NO:4) are also shown in Table 1. 
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Sequence, 5' to 3' 


CAGAGGAAGAGAATCTCCGCAAGAA 


GAATCTCCGCAAGAAAGGGGAGCCT 


CGAGCTGCCCCCAGGGAGCACTAAG 


CCAGGGAGCACTAAGCGAGCACTGC 


TGAATGAGGCCTTGGAACTCAAGGA 


AAGGATGCCCAGGCTGGGAAGGAGC 


AGGCTGGGAAGGAGCCAGGGGGGAG 


GGAGCCAGGGGGGAGCAGGGCTCAC I 


TGGGCTACACTGAGCACCAGGTGGT 


AATATGATGACATCAAGAAGGTGGT I 


ATCCCTGAGCTAGACGGGAAGCTCA 


AACTGTGGCGTGATGGCCGCGGGGC 


GTGTGAACCATGAGAAGTATGACAA | 


ITTCGTCATGGGTGTGAACCATGAGA 
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Figure 2 represents a magnified view of the background feature signal from Figure 
1. In addition, the local background signals from this region of the array are shown, with 
symbols highlighting the signal from the local background surrounding each of the 
background features. As illustrated in Figure 2, the signal of the background features is 
5 often less than the signal of the surrounding local background. As discussed above, it is 
common practice to subtract the local background surrounding a given feature from the 
signal of that feature, in order to obtain background-corrected signal values. If this were 
done with the background features seen in Figure 2, the background-corrected signal 
would be negative for five of the six features. It is also clear from Figure 2 that the signal 
10 from the background features follows the same general trend as the signal from the local 
background (i.e. they are sensitive to local variations in the background signal). 

Example 2 

Utilization of empirically observed inactive probes 
15 The empirically observed background features were tested against other purified 

cRNA labeled targets as well as a labeled complex cRNA pool from human liver. An 
experiment was performed that contained five heterologous targets ("xenogenes") in a 
labeled human liver cRNA pool. The xenogenes used were cab, cor47, sigl, pbpl and 
pbp2 (see Table 2 for gene information; target polynucleotides are the Watson-Crick 
20 complements of the mRNA). The array contained features of the positive control probes 
(SEQ ID NOs: 2 & 3), background probes (SEQ ID NO: 17), and probes that were being 
tested for their specific hybridization to the xenogene targets. 
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The R6G-labeled cRNA xenogene targets were all present at 30 pM. The R6G- 
labeled human liver cRNA was present at 150 ug/ml. Figures 3 shows a scanned image of 
this array. The presence of the complex pool cRNA greatly increases the signal of the 
local background, compared with arrays without complex pool RNA, making the 
5 background features even more visible as "negative features." 

The background features were thus validated to yield minimal signal when 
presented with the five labeled xenogene targets and the labeled complex human liver 
cRNA pool. Other experiments further validated that labeled complex cRNA pools from 
yeast and mouse also yielded minimal signal with the background features. 
10 The appearance of "negative features" is not a requirement for background probes, 

although "negative features" were consistently observed during the initial experiments 
discussed above. Other types of array surfaces or variations in the methods used to 
hybridize or wash the arrays can change the nature of the background signal of the local 
background such that "negative features" are no longer visible. 

15 

Example 3 

Probes forming highly stable intramolecular structures 
Probes were designed to form highly stable intramolecular secondary structures. 
Such structures are well known to the art and include hairpins and pseudo-half knots. It 
20 was hypothesized that such probes would hybridize poorly to any target and would possess 
nonspecific binding properties similar to those of hybridization probes; thus, these probes 
would be good candidates for background probes. 

A. Design of hairpin probes as background probes 

The design of these probes utilized stable base pairing for the stem portion and 
25 utilized variations of the "GAAA" sequence with a C:G clamp for the tetra-loop portion of 
the structure (Antao, V.P. et al., Nucleic Acids Research, 1991, 19:5901-5905). Multiple 
structures were designed and the self-structure properties of the probes were calculated 
using published thermodynamic parameters and known algorithms, such as the "MFOLD" 
method ( Jaeger, J.A. et al., Proc. Natl. Acad. ScL, 1989, 86:7706-7710; and Li, Y. and 
30 W.D. Wilson, Biochemistry, 30:7566-7572). Representative probes are listed in table 3 
and were studied further. 
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Table 3 
Designed hairpin probes 



10 



Probe 
Name 


Sequence ID 
Number 


Sequence 


Predicted 

T m (°Q* 


ST1 


SEQ ID NO: 24 


GCTAGCGAAAGCTAGC 


83 


ST2 


SEQ ID NO: 25 


GCGAGCGAAAGCGAGC 


71 


ST3 


SEQ ID NO: 26 


GCAGGCGAAAGCAGGC 


48 


ST4 


SEQ ID NO: 27 


GCAGGGGAAAGCAGGC 


<30 


ST5 


SEQ ID NO: 28 


GCATACCGAAGCACGC 


<44 



* Melting temperature were predicted according to the methods described by Jaeger, et al. 
and Li, et al. 

15 The self-structure properties of the probes was experimentally verified to form 

highly stable intramolecular duplex structures by measuring the DNA T m as a function of 
oligonucleotide concentration. DNA T m 's were measured using a commercially available 
apparatus, the Perkin-Elmer UV/VTS Spectrometer model Lambda 14. Solutions of test 
oligonucleotides, ranging from 0.2 to 10 optical densities (measured at 260 nm), were 

20 made with 6X SSPE buffer (described above in Example 1). The absorbance at 260 nm 

was followed from 20°C to 95°C, ramping at 0.5 °C/minute. Data analysis was performed 
using the spectrometer' s "Pick Peak" function. 

A high, concentration-independent, duplex melting temperature demonstrates the 
formation of a stable intramolecular secondary structure. For example, a 16-mer DNA 

25 oligonucleotide, ST1 (SEQ ID NO: 24), exhibited a concentration-independent solution 
melting temperature of about 80° C, presumably due to formation of the intramolecular 
secondary structure illustrated below: 



30 



35 



A A 
G A 

C-G 

G-C 

A-T 

T-A 

C-G 

G-C 
5' 3* 



ST1 (SEQ ID NO: 24) 
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B. Testing of designed hairpin probes as background probes 
The designed probes from Table 3 were tested as background probes by designing 
an array that contained features from these probes, along with features of the validated 
G3PDH-570 background probe (SEQ ID NO: 17). R6G-labeled yeast cRNA pool target 
5 was hybridized to this array. Probe synthesis on the array, and conditions used for 
hybridization, washing, and scanning were as described Example 1 above, with the 
exception that the hybridization buffer contained no BSA or SDS. 

Figure 4 illustrates the results of testing designed hairpin probes as background 
probes, and demonstrates that all of the hairpin probes listed in Table 3 (i.e. SEQ ID NO: 
10 24-28) yielded minimal signal, indicating poor hybridization to the labeled yeast RNA pool 
target. The signal of the hairpin probe features was similar to the signal of the validated 
G3PDH-570 background probe features (average signal = 258 counts, SD = 32, 13%CV; 
SEQ ID NO: 17). The background features (i.e. SEQ ID NO: 17 and 24-28) were observed 
as "negative features" in the scanned image, as observed in Figure 3. Table 4 summarizes 
15 the average, inter-feature standard deviation and inter-feature %CV for the five designed 
hairpin probe features (25 replicates in total) compared with the statistics of the local 
backgrounds surrounding these features. 

Table 4 

Signal statistics of hairpin probe features and of local background 

20 





Features 


Local Background 


Average 


319 


934 


SD 


94 


382 


%cv 


30% 


41% 



25 

Table 4 shows the variation of the signal between replicate features of the hairpin 
probe features (e.g. absolute inter-feature SD's and relative inter-feature % CV) was lower 
than variation of signal between the local backgrounds surrounding these features. Thus, 
the hairpin probe features not only yielded lower signal than the surrounding local 
30 backgrounds, but the signal from the hairpin probe features was also more consistent, 
regardless of the location on the array. 
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Example 4 
Short Probes 

The melting temperature of shorter duplexes is dependent upon length, unlike the 
DNA duplexes longer than about 100 base pairs, wherein the melting temperature is 
5 dependent upon strand concentrations and fraction (G+C). The average predicted melting 
temperature for a set of 10,000 random DNA 25-mers with an average fraction (G+C) of 
0.5 is 69° C, while the average predicted melting temperature for a corresponding set of 
DNA 15-mers is 46.9° C (assuming 100 pM target strand and 1M salt). The decrease in 
melting temperature is related to increase in the relative entropic cost of duplex formation 

10 as the strands grow shorter. Thus, it was hypothesized that using shorter probes as 
background probes would result in minimal binding of target. 

Variable lengths of probes were designed in order to determine the shortest length 
that is still effective as background probes. Empirically, the correct length of probe should 
be found such that the nonspecific binding properties of surfaces bearing these short 

15 probes mimics the properties of surfaces bearing long probes (e.g. 25-mers). Such regions 
would possess similar surface energy properties as regions bearing longer probes, but 
would exhibit reduced affinities for their complementary target sequences, when employed 
under conditions that are optimal for longer probes. 

The effect of probe length on background signal was determined in the same array 

20 used to assess the background signal properties of the hairpin probes (i.e. Example 3). 

Probes of length 5, 10, 15, and 20 nucleotides were designed by progressively shortening 
the empirically observed inactive probe G3PDH-570 (SEQ ID NO: 17) from its 5'-end (i.e. 
the end opposite from the site of attachment to the array surface). These shortened probes 
are shown in Table 5. 
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Table 5 

Sequences of Shortened probes and original probe sequence 



Probe 
Length 


Sequence ID 
Number 


Sequence 


25 


SEQ ID NO: 17 


GTGTGAACCATGAGAAGTATGACAA 


20 


SEQ ID NO: 29 


AACCATGAGAAGTATGACAA 


15 


SEQ ID NO: 30 


TGAGAAGTATGACAA 


10 


SEQ ID NO: 31 


AGTATGACAA 


5 


SEQ ID NO: 32 


GACAA 



Figure 5 illustrates the results of a hybridization assay, wherein the shortened 
G3PDH-570 probes were hybridized to R6G-labeled yeast cRNA. The background 
features (i.e. SEQ ID NO: 17 and 29-32) were observed as "negative features" in the 
15 scanned image, as observed in Figure 3. The 20-, 15-, and 10-mer probes (SEQ ID NO: 
29-31) consistently yielded minimal signal. Table 6 summarizes the average, inter-feature 
standard deviation and inter-feature %CV for the two best shortened probes (SEQ ID NO: 
29 and 30), the parent 25-mer G3PDH-570 probe features (SEQ ID NO: 17), and the local 
backgrounds surrounding these features. 

20 

Table 6 

Signal statistics of variable-length probe features and of local background 



SeqID 


#17 


#29 & 30 




Probe Length 


25-mer 


20 & 15-mer 


Local Background 


Average 


258 


297 


694 


SD 


32 


92 


328 


%cv 


13% 


31% 


47% 



30 As discussed in Example 3, the background probe features provide a lower and 

more consistent estimate of background signal than do the local background 
measurements. 
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Example 5 
Chemically Modified Probes 
A background probe can also be produced by the synthesis of oligomeric abasic 
phosphodiesters containing a substituted ribose, or ones where the substituted ribose has 
5 been replaced by another moiety such as a polyether. Such abasic phosphodiesters possess 
polyelectrolyte properties similar to normal DNA and yield a surface with similar 
nonspecific binding properties as surfaces bearing normal DNA strands, but are unable to 
hydrogen bond to target species containing nitrogenous bases. 

Examples of several phosphoramidite monomers useful to construct an abasic 
10 phosphodiester oligomer, using standard chemical synthetic techniques, are shown in 

Figure 6. In structure I, the nitrogenous base in the deoxyribose sugar ring is replaced with 
hydrogen. In structure IE, the deoxyribose sugar ring is replaced by a triethylene glycol 
unit. Analogs of Structure II are obtained by replacing triethylene glycol with 
monoethyleneglycol and hexaethyleneglycol. Compounds of Structures I and II are 
15 commercially available (Glen Research, Sterling, VA). 

Further examples of modified DNA probes include probes comprising reverse 
polarity nucleotide analogs, i.e. probes wherein the deoxyribose sugar-nitrogenous base 
backbone comprises certain nucleotides attached such that they are in opposite polarity as 
compared to the adjacent nucleotides. (See U.S. Patent Nos. 5,399,676; 5,527,899 and 
20 5,721,218 and Koga, M. et al. (1991) J. Org. Chem. 56:3757-3759). 

The methods described in Example 1 above enable the synthesis of background 
probes made with modified nucleotides. The use of such background probes in 
hybridization arrays permits an accurate estimation of the background signal of hybridized 
array features. Additionally, such estimation of the background allows for a lower limit of 
25 detection (LLD) for the desired target molecule. 

Example 6 

Use of background features in background-correcting 
the signal of hybridization features 
30 The array shown in Example 2 (Figure 3) was hybridized with the R6G-labeled 

xenogene cRNA's present at 30 pM and was part of a series of arrays that were hybridized 
with variable concentrations of the xenogene targets. The purpose of the series of arrays 
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was to determine the LLD of specific probes against the five xenogene targets. This series 
of experiments is referred to as the xenogene titration experiment. In order to determine 
LLD's, one must first background-correct the total signal observed in each feature. The 
effect of using background features vs. local background for background-correction is 
5 demonstrated in this example. The example uses the hybridization data of the Re- 
labeled cor47 and pbp2 cRNA targets with their respective specific probes, cor47-181 
(SEQ ID NO: 33) and pbp2-867 (SEQ ID NO: 34): 
SEQ ID NO: 33 

AGGAGAACAAGATTACTCTGCTAGA 
10 SEQ ID NO: 34 

TTCGTTTCCCCATCTGGCTGGATGA 

The probes cor47-181 (SEQ ID NO: 33) and pbp2-867 (SEQ ID NO: 34) were each 
present in four replicate features on the array. There were 17 replicates of background 
1 5 features (N = 9 replicates of G3PDH-570; SEQ ID NO 17 and N = 8 replicates of ST 1 ; 
SEQ ID NO: 24). The array had 200 features total and the regions around each of these 
features comprised the 200 regions of local background. Table 7 shows the statistics of the 
background features and local backgrounds. 

20 Table 7 

Signal statistics of background features and of local background 





Background Features 


Local Background 
of All Features 


N 


17 


200 


Minimum 


13 


35 


Average 


20 


108 


SD 


8 


123 


%cv 


41% 


114% 


3 *SD 


24 


369 



30 

The average signal from the background features is much lower than the average of 
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the local background (i.e. 20 < 108). Figure 7 shows a scanned image demonstrating a 
typical region of local background and background features. This image is a section of the 
array illustrated in Figure 3. Replicate features are shown for the PR025G positive control 
oligonucleotide (SEQ ID NO: 2), the background probe ST1 (SEQ ED NO: 24), as well as 
5 two replicates of the pbp2-867 probe (SEQ ED NO: 34). The signals from the pbp2-867 
(SEQ ID NO:34) features (i.e. features #1 12 and #118) are of similar strength as some 
portions of the local background signal, but they are stronger than the signal from the 
background features. 

The quantitative signal data for the array region around pbp2-867 feature #118 

10 (SEQ ID NO: 34) is shown in Figure 8 (area of magnification indicated in Figure 3). The 
positive control features (SEQ ED NO: 2; average signal = 4,280) have been omitted from 
this figure in order to magnify the signal scale for the features of interest. Figure 8 shows 
that the signals from the local backgrounds are greater than the two background features, 
#1 15 (ST1; SEQ ED NO: 24) and #131 (G3PDH-570; SEQ ED NO: 17). The signal from 

15 feature #118 (pbp2-867 probe; SEQ ED NO: 34) is greater than the local background, but it 
is even greater than the background features. Thus, the background-corrected signal for 
feature #1 18 (SEQ ED NO: 34) is greater when corrected with background features (net 
signal = 123; Table 8) than when corrected with local background (net signal = 69; Table 
8). 
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Another common method of background-correction is to use the minimum of all 
local backgrounds, referred to in these examples as the global background (signal = 35; 
Table 7). The use of this global background to background-correct feature #118 (SEQ ID 
NO: 34) yields a signal of 108 (Table 8). Table 8 also shows the different background- 
5 corrected values that are obtained for the other three replicate features of probe pbp2-867 
(SEQ ID NO: 34). With all four of the pbp2-867 (SEQ ID NO: 34) features, the use of 
background features yields the highest background-corrected signals. 

The above analysis was performed on all 158 hybridization features of this array. 
The results were the same as observed with the four features of the pbp2-867 (SEQ ID NO: 
10 34) probe. Thus, the use of background features, as opposed to local background or global 
background, yielded the highest signal values of all background-corrected feature signals. 
However, in order to establish background features as the best background-correction 
method, the background-corrected signals need to be accurate, as well as yield higher 
signal values. 

15 One measure of the accuracy of a background correction method is the degree with 

which the method compensates for variations in the background signal. The use of local 
backgrounds for background correction of features assumes that the different signal values 
detected at the different local background regions reflects the actual background signal in 
their respective neighboring features. This hypothesis was tested by comparing the 

20 reproducibility of signals among replicate features of the same hybridization probe. If the 
above assumption of local background correction is correct, the reproducibility of signals 
among replicate features should be better when corrected with the local background 
method than when uncorrected or when background-corrected with a constant value of 
background. However, this was not the case in the probes studied. 

25 For instance, with the four replicates of the pbp2-867 probes (SEQ ID NO: 34), the 

inter-feature variation (calculated as the standard deviation (SD) among the four feature 
signal values) was 48 when corrected with local background, as opposed to 44 with no 
background-correction or with a constant value of background-correction (see Table 8). 
This same analysis was performed with the inter-feature standard deviations of the four 

30 replicates from each of the 42 xenogene hybridization probes. Figure 9 shows that the 

inter-feature standard deviations (inter-feature SD) were increased when the feature signals 
were corrected with the local background method, as opposed to no background-correction 
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(i.e. SD values above the 1:1 correlation line). The increase in inter-feature SD was 62 
signal counts on average, and as high as 362 signal counts. There were three exceptions to 
this observation. In the three cases where the use of local background correction decreased 
the inter-feature SD, the signal values decreased by less than 6 counts. 
5 These results illustrate that background-correction using local background 

generally made the inter-feature statistics worse, indicating that the background signal 
reported in a given local background region did not track the actual background signal 
within its neighboring feature. Background correction with a constant value yielded more 
accurate feature signals. 

10 The same conclusion was arrived at by studying the average inter-feature %CV 

metric ([inter-feature standard deviation of replicates/inter-feature mean of replicates] x 
100%). This metric used signals before background correction in order to determine the 
magnitude of variability among replicate feature signals, and to determine which 
background method most closely modeled that variation. The inter-feature %CV of the 

15 four replicates from each of the 42 xenogene hybridization probes was calculated and 

yielded an average of 25%CV and a maximum of 47%CV. This variation among replicate 
features is more closely modeled by the inter-feature variation of the background features 
than by the variation of local backgrounds (i.e. 41% CV and 1 14% CV, respectively; Table 
7). Thus, the local background method samples a more variable source of background 

20 signal, as well as a higher level of background signal, than the background feature method. 
Studies comparing the two constant type background-correction methods (i.e. 
global background and background features) are described below. In order to determine 
the accuracy of global background vs. background features, features with low signal values 
from five of the arrays of the xenotitration experiment (i.e. those with target concentrations 

25 ranging from 3 pM to 3,000 pM) were studied. 

Low signal values were obtained with some probes that poorly hybridized with 
their correct target at all but the highest target concentration. Other probes yielded low 
signals only at the lowest target concentrations. An example of a poorly hybridizing probe, 
cor47-181 (SEQ ID NO: 33), is shown in Figure 10. The mean of the uncorrected replicate 

30 cor47-181 (SEQ ID NO: 33) feature signals and lx SD error bars (i.e., inter-feature 

standard deviation) are illustrated for five target concentrations. The average background 
feature signal and global background signal from each of the arrays is also plotted. At a 
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target concentration of 300 pM (log concentration = 2.5), the signals from the replicate 
features of probe cor47-181 (SEQ ID NO: 33) were greater than the signal of the 
background features, but less than the signal of the global background. At the target 
concentrations of 30 pM and 100 pM (log concentration =1.5 and 2.0), the signals of the 
5 cor47-181 (SEQ ID NO: 33) probes were within the distribution of the background 

features and, again, lower that the global background signal. At the target concentrations 
of 3 pM (i.e. log concentration =1.5) and 0 pM (not shown on the log plot), the signals of 
the cor47-181 (SEQ ID NO: 33) probes were within the distribution of the background 
features and within 4 signal counts of the global background signal. 

10 Figure 10 demonstrates that the average signal from the background features 

closely models the low signals observed from the cor47-181 (SEQ ID NO: 33) features. In 
contrast, the global background overestimates the background signal for three arrays (at 30, 
100 and 300 pM target concentrations) and correction with the global background yields 
negative net signals for these probes. The local background signals surrounding each of 

15 the cor47-181 (SEQ ID NO: 33) probe features are not shown in Figure 10, but these 
signals are even greater than the global background signal, since the global background 
signal used from each array is the minimum of all local background signals from that array. 

A similar analysis was performed with the inter-feature means of all 42 
xenotitration hybridization probes from the 5 arrays (target concentrations from 3 pM to 

20 3,000 pM). Of the 210 total inter-feature means, 190 inter-feature means were greater than 
either the global background or the background feature significant limit of their respective 
arrays. The signal value used for the background feature significant limit of each array is 
the sum of the background features mean and [1 x Background_SD (b-SD)], where the 
Background_SD (b-SD) is defined as the inter-feature standard deviation among the 

25 background features of that array. This metric is used to determine the significance of 
feature signal values to be used in LLD calculations, as discussed below in Example 7. 
The signal characteristics of the lowest 20 inter-feature means are shown in Table 9: 
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Table 9 

Xenotitration hybridization probes 
Distribution of lowest signal inter-feature means 



Inter-feature means 


N 


Characterized as 


> Background Feature (Mean + 
lx SD) & < Global Background 


13 


Significant using background 
features, not significant using 
global background 


Within Background Feature 
(Mean +/- lx SD) & < Global 
Background 


4 


Similar distribution as 
background 


Within Background Feature 
(Mean +/- lx SD) & <= Global 
Background 


3 


Similar distribution as 
background features and 
global background 



15 The use of background features for background-correction yields thirteen probes 

with significant signal and four probes which were of the same distribution as the 
background features. Global background overestimated the background signal for these 
seventeen probes. Additional evidence for the accuracy of the background features vs. 
global background (in context of LLD calculations) is described in Example 7, infra. 

20 

Example 7 

Use of background features in determinations of LLD 
This example demonstrates the impact that different background-correction 

25 methods have on the calculation of Lower Limit of Detection (LLD). As demonstrated in 
Example 6, the background signal values using background features are generally lower 
than the signals using either the local or global background methods. Additionally, the 
variation of signal among background feature replicates is much less than the variation 
among local background regions (see, e.g., Tables 4, 6 and 7). 

30 The variation of background signal (i.e. the noise of the background signal) is 

important since it is used in calculations to determine whether a feature is "significant". A 
standard deviation (1 x b-SD) of the background, used as the noise, must be exceeded for a 
background-corrected feature to be significant. For the background feature method and the 



-48- 



Attorney Docket No. 10981620-1 



global background method, the noise metric is the inter-feature standard deviation (e.g. 
signal = 8 vs. 123, respectively; Table 7). Since the local background method uses only 
one signal value per local background region to background-correct its respective 
neighboring feature, there is no inter-feature standard deviation. The noise metric for local 
5 background is defined as the intra-feature standard deviation (pixel signal variation) of the 
local background region. 

As discussed above, features with low observed signals may or may not exceed 
significant limits, depending upon the background-correction method used (e.g. Table 9). 
A graphical representation of the analysis is illustrated in Figure 11, using the same array 

10 and feature region used for Figure 8. In order to compare the differences among the three 
background methods on the same figure, the background-uncorrected signals of 
hybridization features are used. The background-uncorrected signal is significant if it is 
greater than the sum of the estimated background signal and (1 x b-SD) for each of the 
three background-correction methods. In addition to illustrating the signal from 

15 hybridization features, the significance limits for each of the three background-correction 
methods are also demonstrated in Figure 11. With the background feature method, 12 of 
the 13 hybridization features were significant. In contrast, only 2 or 4 hybridization 
features were significant with the global background method or local background method, 
respectively. The analyses of Figure 1 1 and Table 9 demonstrate that the use of 

20 background features, as compared to local background or global background, permits more 
features to be identified as significant, and thus to be used in LLD calculations. 

The following section describes the algorithm used for LLD calculations. All 
hybridization features which pass the background significance test, described above, are 
averaged for a given probe for each array. A plot of log(background-corrected signal) vs. 

25 log(target concentration) is plotted for each probe. If a probe has not passed the 

significance test at a given target concentration, the background significance signal value 
(1 x b-SD) is used for the "y" value at that array concentration. A linear regression 
calculation is performed on the linear portion of the data. The LLD concentration is 
calculated at the point where the best fit linear regression line crosses a threshold. The 

30 threshold for these LLD calculations is calculated as (3 x average b-SD). The b-SD values 
are calculated for each array, as described above. The average of the five array values is 
multiplied by three for the LLD threshold (3 x average b-SD). An example of an LLD 
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calculation is shown in Figure 12 for the probe, pbpl-203 (SEQ ID NO: 35): 
SEQ ID NO: 35 

GGTTATTTCCGGTGGCAGCACGCTC 



5 Figure 12 compares the background feature method with the global background 

method. The comparison does not include the local background method, since the data 
presented in Example 6, especially Figure 9, demonstrated the deficiencies of this method. 
For purposes of demonstrating both background-correction methods on the same figure, 
the non-background-corrected feature signals are used. The equivalent LLD thresholds 

10 shown are thus the sums of background signal and (3 x average b-SD). 

As explained above, the global background method uses the minimum of all local 
background signals (signal = 35, from Table 7 for the 30 pM target array) as the estimated 
background signal for all features on that array and uses three times the SD among the 
local backgrounds (e.g. 3 x b-SD signal = 369, from Table 7 for the 30 pM target array) as 

15 the noise component for that array. The threshold value (3 x average b-SD) for the five 
xenotitration arrays is 138. Since non-background-corrected data is used in Figure 12, the 
global background threshold plotted for each array is the sum of (the global background 
signal of that array + 138). The average of these five sums is the threshold used to 
calculate the LLD, shown as the "average global background threshold" line in Figure 12. 

20 The LLD calculated for the pbpl-203 probe (SEQ ID NO:35) using the global background 
method is 16.7 pM. 

As explained above, the background feature method uses the average of all 
background features of an array as the estimated background signal for all features on that 
array (e.g. signal = 20, from Table 7 for the 30 pM target array). The background features 

25 3 x interfeature_SD is calculated (e.g. 3 x b-SD signal = 24, from Table 7 for the 30 pM 

target array) as the noise component for that array. The threshold value (3 x average b-SD) 
for the five xenotitration arrays is 15. Since non-background-corrected data is used in 
Figure 12, the background feature threshold plotted for each array is the sum of (the 
average background feature signal of that array +15). The average of these five sums is 

30 the threshold used to calculate the LLD, shown as the "average background feature 

threshold" line in Figure 12. The LLD calculated for the pbpl-203 probe (SEQ ID NO:35) 
using the background feature method is 0.9 pM, 18-fold lower than that calculated with the 
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global background method. 

LLD values were calculated with both background-correction methods for all of the 
xenogene probes studied in the xenotitration experiment. Figure 13 shows a plot 
comparing the LLD's calculated with global background vs. background features. The 

5 data includes 42 probes, each having 3 to 4 features replicates per array, that were 

hybridized with five concentrations of their specific targets (i.e. 3 pM to 3,000 pM). It is 
clear from Figure 13 that the use of background features for background signal estimation 
and threshold determination yields much lower limits of detection than the use of global 
background (i.e LLD values below the 1: 1 correlation line). The average decrease in LLD 

10 from using global background to using background features is 30-fold, a very large 
increase in assay sensitivity. 

Thus, methods of determining the presence and/or amount of a target nucleotide 
sequence in an analyte by determining the extent of hybridization of a labeled probe to a 
15 target nucleotide sequence is disclosed. Although preferred embodiments of the invention 
device have been described in some detail, it is understood that obvious variations can be 
made without departing from the spirit and scope of the invention as defined by the 
appended claims. 
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CLAIMS 

We claim: 

5 1. A set of features comprising oligophosphodiester probes, wherein said 

features comprise 

(a) hybridization features comprising hybridization probes that selectively 
hybridize to a detectably labeled target nucleotide sequence, and 

(b) background features comprising background probes that do not selectively 
10 hybridize to said target nucleotide sequence. 

2. The set of claim 1, wherein said hybridization probes and said background 
probes are bound to a surface or are present in solution. 

15 3. The set of claim 2 wherein said surface is an array surface. 

4. The set of claim 1, wherein the background probes comprise empirically 
observed inactive probes, probes forming stable intramolecular structures, short probes, 
reverse polarity nucleotide analogs, abasic phosphodiesters or modified nucleotidic units. 

20 

5. The set of claim 4, wherein the background probes comprise empirically 
observed inactive probes selected from the group consisting of 

CAGAGGAAGAGAATCTCCGCAAGAA (SEQ ID NO: 5); 

GAATCTCCGCAAGAAAGGGGAGCCT (SEQ ID NO: 6); 
25 CGAGCTGCCCCCAGGGAGCACTAAG (SEQ ID NO: 7); 

CCAGGGAGCACTAAGCGAGCACTGC (SEQ ID NO: 8); 

TGAATGAGGCCTTGGAACTCAAGGA (SEQ ID NO: 9); 

AAGGATGCCCAGGCTGGGAAGGAGC (SEQ ID NO: 10); 

AGGCTGGGAAGGAGCCAGGGGGGAG (SEQ ID NO: 11); 
30 GGAGCCAGGGGGGAGCAGGGCTCAC (SEQ ID NO: 12); 

TGGGCTACACTGAGCACCAGGTGGT (SEQ ID NO: 13); 

AATATGATGACATCAAGAAGGTGGT (SEQ ED NO: 14); 
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ATCCCTGAGCTAGACGGGAAGCTCA (SEQ ID NO: 15); 
AACTGTGGCGTGATGGCCGCGGGGC (SEQ ID NO: 16); 
GTGTGAACCATGAGAAGTATGACAA (SEQ ID NO: 17); and 
TTCGTCATGGGTGTGAACCATGAGA (SEQ ID NO: 18). 

5 

6. The set of claim 4, wherein the background probes comprise probes 
forming stable intramolecular structures, wherein the probes are selected from the group 
consisting of 

GCTAGCGAAAGCTAGC (SEQ ID NO: 24); 
10 GCGAGCGAAAGCGAGC (SEQ ID NO: 25); 

GCAGGCGAAAGCAGGC (SEQ ID NO: 26); 

GCAGGGGAAAGCAGGC (SEQ ID NO: 27); and 

GCATACCGAAGCACGC (SEQ ID NO: 28). 

15 7. The set of claim 4, wherein the background probes comprise short probes, 

wherein the probes are selected from the group consisting of 

AACCATGAGAAGTATGACAA (SEQ ID NO: 29); 

TGAGAAGTATGACAA (SEQ ID NO: 30); 

AGTATGACAA (SEQ ID NO: 3 1); and 
20 GACAA (SEQ ID NO: 32). 

8. The set of claim 4, wherein the background probes comprise reverse 
polarity nucleotide analogs. 

25 9. The set of claim 4, wherein the background probes comprise abasic 

phosphodiesters or modified nucleotidic units. 

10. A method of detecting the presence and/or amount of a target nucleotide 
sequence in an analyte, said method comprising: 
30 (a) providing an analyte suspected of containing the target nucleotide sequence; 

(b) contacting an aliquot of said analyte suspected of containing said target 
nucleotide sequence with a set of features comprising oligophosphodiester probes, wherein 
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said target nucleotide sequence is labeled with a detectable label capable of generating a 
measurable signal, and further wherein said features comprise: 

(i) hybridization features comprising hybridization probes that selectively 
hybridize to said labeled target nucleotide sequence, and 
5 (ii) background features comprising background probes that do not selectively 

hybridize to said labeled target nucleotide sequence; 
(c) detecting an observed signal, wherein the observed signal is an amount of 
signal generated from contacting the target nucleotide sequence with said features 
comprising oligophosphodiester probes; 
10 (d) detecting a background signal, wherein said background signal is an amount 

of signal generated from said background features; 

(e) subtracting the background signal from the observed signal to determine the 
presence and/or amount of said target nucleotide sequence in said analyte. 

15 11. The method of claim 10 wherein said hybridization probes and said 

background probes are bound to an array surface. 

12. The method of claim 10, wherein the wherein said target nucleotide 
sequence is directly labeled with said detectable label. 

20 

13. The method of claim 10, wherein the wherein said target nucleotide 
sequence is indirectly labeled with said detectable label. 

14. The method of claim 10, wherein said signal is detected by colorimetric, 
25 fluorimetric, chemiluminescent or bioluminescent techniques. 

15. The method of claim 10, wherein the background probes comprise 
empirically observed inactive probes, probes forming stable intramolecular structures, 
short probes, reverse polarity nucleotide analogs, abasic phosphodiesters or modified 

30 nucleotidic units. 

16. The method of claim 15, wherein the background probes comprise 
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empirically observed inactive probes selected from the group consisting of 

CAGAGGAAGAGAATCTCCGCAAGAA (SEQ ID NO: 5); 

GAATCTCCGCAAGAAAGGGGAGCCT (SEQ ID NO: 6); 

CGAGCTGCCCCCAGGGAGCACTAAG (SEQ ID NO: 7); 
5 CCAGGGAGCACTAAGCGAGCACTGC (SEQ ID NO: 8); 

TGAATGAGGCCTTGGAACTCAAGGA (SEQ ID NO: 9); 

AAGGATGCCCAGGCTGGGAAGGAGC (SEQ ID NO: 10); 

AGGCTGGGAAGGAGCCAGGGGGGAG (SEQ ID NO: 1 1); 

GGAGCCAGGGGGGAGCAGGGCTCAC (SEQ ID NO: 12); 
10 TGGGCTACACTGAGCACCAGGTGGT (SEQ ID NO: 13); 

AATATGATGACATCAAGAAGGTGGT (SEQ ID NO: 14); 

ATCCCTGAGCTAGACGGGAAGCTCA (SEQ ID NO: 15); 

AACTGTGGCGTGATGGCCGCGGGGC (SEQ ID NO: 16); 

GTGTGAACCATGAGAAGTATGACAA (SEQ ID NO: 17); and 
1 5 TTCGTCATGGGTGTGAACCATGAGA (SEQ ID NO: 1 8). 



17. The method of claim 15, wherein the background probes comprise probes 
forming stable intramolecular structures, wherein the probes are selected from the group 
consisting of 

20 GCTAGCGAAAGCTAGC (SEQ ID NO: 24); 

GCGAGCGAAAGCGAGC (SEQ ID NO: 25); 
GCAGGCGAAAGCAGGC (SEQ ID NO: 26); 
GCAGGGGAAAGCAGGC (SEQ ID NO: 27); and 
GCATACCGAAGCACGC (SEQ ID NO: 28). 

25 

18. The set of claim 15, wherein the background probes comprise short probes, 
wherein the probes are selected from the group consisting of 

AACCATGAGAAGTATGACAA (SEQ ID NO: 29); 
TGAGAAGTATGACAA (SEQ ID NO: 30); 
30 AGTATGACAA (SEQ ID NO: 3 1); and 

GACAA (SEQ ID NO: 32). 
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19. The method of claim 15, wherein the background probes comprise reverse 
polarity nucleotide analogs. 

20. The method of claim 15, wherein the background probes comprise abasic 
5 phosphodiesters or modified nucleotidic units. 

21. A method for estimating background noise in a nucleic acid hybridization 
assay, said method comprising providing a set of features comprising oligophosphodiester 
probes, wherein said features comprise hybridization features comprising hybridization 

0 probes that selectively hybridize to a target nucleotide sequence, and background features 
comprising background probes that do not selectively hybridize to said target nucleotide 
sequence; and subtracting the background signal, wherein said background signal is an 
amount of signal generated from said background features, from the observed signal, 
wherein the observed signal is an amount of signal generated from contacting the target 

5 nucleotide sequence with said features comprising oligophosphodiester probes. 

22. The method of claim 21 wherein said hybridization probes and said 
background probes are bound to an array surface. 

0 23. The method of claim 21, wherein said signal is detected by colorimetric, 

fluorimetric, chemiluminescent or bioluminescent techniques. 



24. The method of claim 2 1 , wherein the background probes comprise 
empirically observed inactive probes, probes forming stable intramolecular structures, 
short probes, reverse polarity nucleotide analogs, abasic phosphodiesters or modified 
nucleotidic units. 

25. The method of claim 24, wherein the background probes comprise 
empirically observed inactive probes selected from the group consisting of 

CAGAGGAAGAGAATCTCCGCAAGAA (SEQ ID NO: 5); 
GAATCTCCGCAAGAAAGGGGAGCCT (SEQ ID NO: 6); 
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CGAGCTGCCCCCAGGGAGCACTAAG (SEQ ID NO: 7); 
CCAGGGAGCACTAAGCGAGCACTGC (SEQ ID NO: 8); 
TGAATGAGGCCTTGGAACTCAAGGA (SEQ ID NO: 9); 
AAGGATGCCCAGGCTGGGAAGGAGC (SEQ ID NO: 10); 
5 AGGCTGGGAAGGAGCCAGGGGGGAG (SEQ ID NO: 1 1); 

GGAGCCAGGGGGGAGCAGGGCTCAC (SEQ ID NO: 12); 
TGGGCTACACTGAGCACCAGGTGGT (SEQ ID NO: 13); 
AATATGATGACATCAAGAAGGTGGT (SEQ ID NO: 14); 
ATCCCTGAGCTAGACGGGAAGCTCA (SEQ ID NO: 15); 
10 AACTGTGGCGTGATGGCCGCGGGGC (SEQ ID NO: 16); 

GTGTGAACCATGAGAAGTATGACAA (SEQ ID NO: 17); and 
TTCGTCATGGGTGTGAACCATGAGA (SEQ ID NO: 18). 

26. The method of claim 24, wherein the background probes comprise probes 
15 forming stable intramolecular structures, wherein the probes are selected from the group 

consisting of 

GCTAGCGAAAGCTAGC (SEQ ID NO: 24); 
GCGAGCGAAAGCGAGC (SEQ ID NO: 25); 
GCAGGCGAAAGCAGGC (SEQ ID NO: 26); 
20 GCAGGGGAAAGCAGGC (SEQ ID NO: 27); and 

GCATACCGAAGCACGC (SEQ ID NO: 28). 

27. The set of claim 24, wherein the background probes comprise short probes, 
wherein the probes are selected from the group consisting of 

25 AACCATGAGAAGTATGACAA (SEQ ID NO: 29); 

TGAGAAGTATGACAA (SEQ ID NO: 30); 
AGTATGACAA (SEQ ID NO: 31); and 
GACAA (SEQ ID NO: 32). 

30 28. The method of claim 24, wherein the background probes comprise reverse 

polarity nucleotide analogs. 
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29. The method of claim 24, wherein the background probes comprise abasic 
phosphodiesters or modified nucleotidic units. 

30. A method of validating a test-background feature comprising test- 
background probes, said method comprising: 

(a) providing an analyte containing a target nucleotide sequence; 

(b) contacting an aliquot of said analyte containing said target nucleotide 
sequence with a set of features comprising oligophosphodiester probes, wherein said target 
nucleotide sequence is labeled with a detectable label capable of generating a measurable 
signal, and further wherein said features comprise 

(i) hybridization features comprising hybridization probes that selectively 

hybridize to a target nucleotide sequence, 

(ii) test-background features comprising test-background probes that do not 

selectively hybridize to said target nucleotide sequence, and 

(iii) standard-background features comprising standard-background probes that do 

not selectively hybridize to said target nucleotide sequence; 

(c) detecting an observed signal, wherein the observed signal is an amount of 
signal generated from contacting the target nucleotide sequence with said features 
comprising oligophosphodiester probes; 

(d) detecting a test-background signal, wherein said test-background signal is 
an amount of signal generated from said test-background features; 

(e) detecting a standard-background signal, wherein said standard-background 
signal is an amount of signal generated from said standard-background features; 

(f) comparing the amount of the test-background signal with the amount of the 
standard-background signal. 

3 1 . The method of claim 30, wherein the wherein said target nucleotide 
sequence is directly labeled with said detectable label. 

32. The method of claim 30, wherein the wherein said target nucleotide 
sequence is indirectly labeled with said detectable label. 
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33. A test kit for detecting the presence and/or amount of a target nucleotide 
sequence in an analyte, said kit comprising a container containing an array of features 
comprising oligophosphodiester probes, wherein said features comprise hybridization 
features comprising hybridization probes that selectively hybridize to a target nucleotide 
sequence, and background features comprising background probes that do not selectively 
hybridize to said target nucleotide sequence. 
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TECHNIQUES FOR ASSESSING NONSPECIFIC BINDING 
OF NUCLEIC ACIDS TO SURFACES 



ABSTRACT 



Methods for substantially improved detection and analysis in nucleic acid 
hybridization assays are described. The methods provide the reliable estimation of 
background signal which derives primarily from nonspecific hybridization. The invention 
is useful in chemical, biological, medical and diagnostic techniques, as well as for drug 
discovery. 
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DECLARATION AND POWER OF ATTORNEY 

FOR PATENT APPLICATION 






m ' ' 



As a below named inventor, I hereby declare that: 

My residence/post office address and citizenship are as stated below next to my name; 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and 
joint inventor {if plural names are listed below) of the subject matter which is claimed and for which a 
patent is sought on the invention entitled: 

TECHNIQUES FOR ASSESSING NONSPECIFIC BINDING OF NUCLEIC ACIDS TO SURFACES 



the specification of which is attached hereto unless the following box is checked: 



{ ) was filed on 
Number 



as US Application Serial No. or PCT International Application 

and was amended on {if applicable). 



I hereby state that I have reviewed and understood the contents of the above-identified specification, 
including the claims, as amended by any amendment(s) referred to above. I acknowledge the duty to 
disclose all information which is material to patentability as defined in 37 CFR 1.56. 

Foreign Application(s) and/or Claim of Foreign Priority 

I hereby claim foreign priority benefits under Title 35, United States Code Section 119 of any foreign application(s) for patent or 
inventor(s) certificate listed below and have also identified below any foreign application for patent or inventor(s) certificate having 
a filing date before that of the application on which priority is claimed: 



COUNTRY 


APPLICATION NUMBER 


DATE FILED 


PRIORITY CLAIMED UNDER 35 U.S.C. 119 








YES: NO: 








YES: NO: 



I hereby claim the benefit under Title 35, United States Code Section 119(e) of any United States provisional application(s) listed 
below: 



APPLICATION SERIAL NUMBER 



FILING DATE 



: U. S. Priority Claim 

I hereby claim the benefit under Title 35, United States Code, Section 120 of any United States application(s) listed below and, 
insofar as the subject matter of each of the claims of this application is not disclosed in the prior United States application in the 
; manner provided by the first paragraph of Title 35, United States Code Section 112, I acknowledge the duty to disclose material 
[information as defined in Title 37, Code of Federal Regulations, Section 1.56(a) which occurred between the filing date of the prior 
| application and the national or PCT international filing date of this application: 



APPLICATION SERIAL NUMBER 


FILING DATE 


STATUS (patented/pending/abandoned) 





















POWER OF ATTORNEY: 

As a named inventor, I hereby appoint the following attorney(s) and/or agent(s) listed below to prosecute this application and 
transact all business in the Patent and Trademark Office connected therewith. 



Bill Kennedy 
Reg. No. 33,407 



Gordon Stewart 
Reg. No. 30,528 



Philip S. Yip 
Reg. No. 37,265 



Herbert R. Schulze 
Reg. No. 30,682 



Send Correspondence to: 
IP Adminibtrahon 
Li-jnl Department. 20BN 
HEWLtTT-PACKARD COMPANY 
P O Box 10301 

Palo Atto, Californiaj§43«3rg89( 



Direct Telephony Call 



s To: 




I hereby declare that all statements made herein of my own knowledge are true and that all statements 
made on information and belief are believed to be true; and further that these statements were made 
with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful 
false statements may jeopardize the validity of the application or any patent issued thereon. 

Full Name of Inventor: Glenda C. Delenstarr Citizenship: [)$ 

Residence: 1512 Pine Knoll Drive, Belmont, CA 94002 

Post Office Address: Same as residence 



Inventor s Signature 7^ 



Date 



Rev 06/99 (DecPwr) 
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Full Name of # 2 joint inventor: Steven M. LefkowitZ Citizenshi p: US 

Residence: 1308 Vista Grande, Millbrae, CA 94030 



Post Office Address: Same as residence 



Inventor s Signature Date 



Full Name of # 3 joint inventor: Kevin J. Luebke Citizenship: US 



Residence: 5821 Goliad Avenue, Dallas, TX 75206 



Post Office Address: Same as residence 



Inventor s Signature Date 



Full Name of # 4 joint inventor: Leslie B. Overman Citizenship: US 



Residence: 4751 Mt. Harris Drive, San Diego, CA 92117 

Same as residence 



Post Office Address: 



inventor s Signature Date 



Full Name of # 5 joint inventor: Nicholas M. Sampas Citizenship: US 



Residence: 1418 Cherry Avenue, San Jose, CA 95125 



3 Post Office Address: Same as residence 



- inventor s Signature \P 17 J/ KZTZ ' r- 



Date 



Full Name of # 6 joint inventor: Jeffery R. Sampson Citizenship: US 
Residence: 1228 Drake Avenue, Burlingame, CA 94010 




Date 



Full Name of # 7 joint inventor: Paul K. Wolber Citizenship: US 



Residence: 531 Torwood Lane, Los Altos, CA 94022 



Post Offise Address: Same as residence 




Date 



Full Name of # 8 joint inventor: Citizenship: 

Residence: 

Post Office Address: 



inventor s signature Date 
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